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METHOD OP ELIMINATTTM/- 

This application is a continuation-in-part of 
Serial No. 07/858,747, £i led mrch * 



TECHNIHAT. T?Tr. T| p 

the stabij^ "ir" 0 - 1 rSlaCes c ° -thod. of increasing 
stabrlrty and/or utilization of a mR„ a produced bv ! 
9 ana by stating regulatory or inhibitory/iLL^y 
sequences (INS) in t-v,^ „ - • y/inscaumty 

prevent or IZ * ° f the * ene 

prevent or reduce expression • 

to constructs inclu^ ^ ^ventxon also relates 

cruets, including expression vectors cont-*^™ 
genes mutated in accord^nc ' con taimng 

cells co ^ 3i • acc °rdance with these methods and host 
cells containing these constructs. 

The methods of th<=> i m m«*' 

useful for increasing JtL^ZZ 

a mRNA without changing its protein coding °' 

ZllToTTt " e USe£Ul aUO " in9 « '-Sas :r t he 
^-^pression of orenpc! U h<„L , . 3 

yenes wnich would otherwise n«*- 
or which would ™« i 1Se not be expressed 

n would be poorly expressed because of t-h~ „ 
of INS reoionc -!„ „ ause of tne presence 

s» regions m the mRNA transcript Thuc t- hfil «. u 
constructs and host cells of th. ■ • methods, 

increasina i-h. invention are useful for 

ncreasing the amount of protein produced by any gene 

which encodes an mRNA transcript which C anZ 

- L i J '- wmcn contains an INS . 
The methods, constructs and host cells of rho 
invention are useful for increasing rt 6 

Produced from genes such in^o^T™ ° f 
fanm-ra ^ ose coding for growth 

na hiv l gag an d en v, for example 

The invention also relates to usinq the 
constructs of the -in^r,-- . 9 cne 

s or the invention m immuno therapy and 

immunoprophylaxis, e.g as a v*^ • 

therapy aftPr ' vaccine, or in genetic 

erapy after expression in humans 
include or be incorporate SUCh c °^tructs can 

incorporated into retroviral or other 
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expression vectors or they may also be directly injected 
into tissue cells resulting in efficient expression of the 
encoded protein or protein fragment. These constructs may 
also be used for in -vivo or in-vitro gene replacement, 
e.g., by homologous recombination with a target gene in- 
5 situ. 

The invention also relates to certain 
exemplified constructs which can be used to simply and 
rapidly detect and/or define the boundaries of 
inhibitory/instability sequences in any mRNA, methods of 

10 using these constructs, and host cells containing these 
constructs. Once the INS regions of the mRNAs have been 
located and/or further defined, the nucleotide sequences 
encoding these INS regions can be mutated in accordance 
with the method of this invention to allow the increase in 

15 stability and/or utilization of the mRNA and, therefore, 
allow an increase in the amount of protein produced from 
expression vectors encoding the mutated mRNA. 

II. BACKGROUND ART 

20 While much work has been devoted to studying 

transcriptional regulatory mechanisms, it has become 
increasingly clear that post- transcriptional processes 
also modulate the amount and utilization of RNA produced 
from a given gene. These post- transcriptional processes 

25 include nuclear post- transcriptional processes (e.g., 
splicing, polyadenylation, and transport) as well as 
cytoplasmic RNA degradation. All these processes 
contribute to the final steady- state level of a particular 
transcript. These points of regulation create a more 

30 flexible regulatory system than any one process could 

produce alone. For example, a short-lived message is less 
abundant than a stable one, even if it is highly 
transcribed and efficiently processed. The efficient rate 
of synthesis ensures that the message reaches the 
35 cytoplasm and is translated, but the rapid rate of 
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degradation guarantees that t-h« mB „» ^ 

- too high a level . 7 ™ lat * 

Proto-oncogenes c-^ ^7 JTs' Z ^ ^ 

exhibit this kind ot"reg!latff ' ■ 

at very low l^./ d ^. UI ^ ^ ™ jessed 

ouicklv S nn *- . rapidly and are modulated 

T fnt and J ranSlent1 ^ different conditions See 

Hent2S ' B ^hixn. Biophys. Acta 1090-281-292 (19^ 
a review. The rate of degradatinnT ' f ° r 

has been shown to be a TZTo n of ^ ° f thSSe ^ 

more instability/i nh ibitorv IT of one or 

itself. y/^hxbxtory sequences within the mRNA 

short 1 • CSllUlar gSneS Wh±Ch enc ° de unstable or 

short-lived mRNAs have been sh™™ - 

INS „ ichin the be 3 e ? :^ n iii°»^* and D " rich 

of the transcript Th«. ^^T^™ T 

genes encoding granulocvte m „„ 1Ular 3enes delude the 

factor .om-j^:: 6 ;™™:^ stimuiacin9 

8 copies of th= == sequences (containing 

conserved Ltl e rrr e ™ "~ TO hi9h ^ 

a Between mice and humans than the protein 

encoding sequences themselves , 93 » versus 65M !r 
and R. Kan.en. Ceil ^,659-667 | USf , * ! ' <G ' Sha "' 
oncogene (c-^, . ^ J^'™ Che *«»»- 
throughout evoiution ,for e^ll IJITL^ f^"™* 1 

«• Cole and S.E. „a ago , Eazy ^ ll'sl ™ ™ ^ T' 
unstable or shnrt- i ^ -st-J-b/ 180 (1990)). other 

or short-lived mRNAs which hsvo 
contain itt ,~i„v. cn nave been shown to 

contain AU-nch sequences within the 3' tjtr 1n „, „ 
interferons (alpha beta delude 

(IL1 IL2 and ™ 9amrna IFNS) '' inter ^ukins 

(XL1 IL2 and IL3); tumor necrosis factor (TNF) • 

lymphotoxin (Lym) , IgGl induction factor ™> ' 
granulocyte colony stimulating factor (G CSf7 ^ 
oncogene (c-^, , « ^ p^o-cj^ "f^" P ^°" 
-d H. Kamen, Cell M:659 . 667 (1986) * ^ ^ S ^w. 

Wisdom and W. Lee. Gen. & Devel 5 . 2 « , 
A- Shyu et al., Gen. & Devel 5-221 231 lio (1991) (c-^,- 

Wilson and R. Treisman IT 'l 221 ' 231 (199l > ; T. 

m -r treisman. Nature 33_£;396-399 MQQjn * . 

Jones a ^M. Cole, Mol. cell Biol 7 !L ^ '* 

1 uiol. 7:4513-4521 (1987) 
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(c-myc) ; V. Kruys et al. f Proc. Natl. Acad. Sci. USA. 
89:673-677 (1992) (TNF) ; D. Koeller et al . , Proc. Natl. 
Acad. Sci. USA. 88:7778-7782 (1991) (transferrin receptor 
(TfR) and c-fos) ; I. Laird-Of f ringa et al., Nucleic Acids 
Res. 1£: 2387-2394 (1991) (c- myc ) ; D. Wreschner and G. 
5 Rechavi, Eur. J. Biochem. 172:333-340 (1988) (which 
contains a survey of genes and relative stabilities) ; 
Bunnell et al.. Somatic Cell and Mol. Genet. 16:151-162 

(1990) (galactosyltransferase-associated protein (GTA) , 
which contains an AU-rich 3' UTR with regions that are 98% 

10 similar among humans, mice and rats) ; and Caput et al. 

Proc. Natl. Acad. Sci. 83:1670-1674 (1986) (TNF, which 

contains a 33 nt AU-rich sequence conserved in toto in the 

murine and human TNF mRNAs) . 

Some of these cellular genes which have been 
15 shown to contain INS within the 3' UTR of their mRNA have 

also been shown to contain INS within the coding region . 

See , e.g., R. Wisdom, and W. Lee, Gen. & Devel. 5:232-243 

(1991) (c- mvc ) ; A. Shyu et al., Gen. & Devel. 5:221-231 
(1991) (c-fos) . 

20 Like the cellular mRNAs, a number of HIV-1 mRNAs 

have also been shown to contain INS within the protein 
coding regions, which in some cases coincide with areas of 
high AU- content. For example, a 218 nucleotide region 
with high AU content (61.5%) present in the HIV-1 gag 
25 coding sequence and located at the 5' end of the gag gene 
has been implicated in the inhibition of gag expression. 
S. Schwartz et al. f J. Virol. 66:150-159 (1992). Further 
experiments have indicated the presence of more than one 
INS in the gag -protease gene region of the viral genome 
30 (see below) . Regions of high AU content have been found 
in the HIV-1 gag/pol and env INS regions. The AUUUA 
sequence is not present in the gag coding sequence, but it 
is present in many copies within gag/pol and env coding 
regions. S. Schwartz et al., J. Virol. 66:150-159 (1992). 
35 see also , e.g., M. Emerman, Cell 57:1155-1165 (1989) (env 
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gene contains both 3 ' UTR and internal 
inhibitory/instability sequences) • r u„ 

Acad ttci iv „ eguences), c. Rosen, Proc. Natl. 

Acad. Sex., USA 85:2071-2075 (1988) (env) • M 

Hadzopoulou-Cladaras et ai t v -, 

, ' J ' Vlr °l- 63:1265-1274 (1989) 

(env); f. Maidarelli et al t v U989) 

/ / ' * Viro1 - £5:5732-5743 (1991) 

(gag/pol); A. Cochrane et al t ^ n . 

(1991) frWM - rQl - ££^5303-5313 

tJ-ysi) (pol) . f. Maidarelli eh a1 

direct analysis of the , . ""^ ^ that the 

context of T ? fUnCtl ° n of ™ regions in the 

prove s repllCati ™P^ent, full-length HIV-i 

proves xs complicated by the fact that the intragenic 
INS are located in the coding sequences of virion 

T h ZTnti:: otei r s ' They further note th - — m 

protein sequences would in most cases affect 

L P iLa n tl s 0 TT es r wel1 ' which in turn couid ^ 

replication of such mutants. 

For examol^^^ 3 re9l ° nS necess *ril y AU - rich . 

For example. tne coding ^ structurallv 

unrelated to the AU-rich 3 ' UTR ins ,a. Shy u et al Gen 
*jr l - 5 : 221 " 231 see parts £ GSn ' 

that this sequence aim. ■ ^ply ln g either 

a tn ,„ q , . S ^ qUence alone « not sufficient to destabilize 

11 ^ P " that meSSagSS als ° contain a 

domxnant stabilizing element ( M . cole and S e m 

Enzyme 44:l 67 - 180 (199on _ * ^ S - E * MRn ^°- 

to specific ™» W * I Interestingly, elements unique 

to specxf xc mRNAs have also been found which can sta hi i 
a mRNA transcript. One example is the ReC respons ^ 

(B Felber 7 of a nttNA transcript 

(1989M " Natl - SCi ' ^ 1 6:i495-l P 99 

r he , Zt ±S iaa ° m Whether the AU sequences 

themselves, and specifically the Shaw Kam.n ^ enCeS 
AUUUA, act as part or all of T ! sequence, 

j j. -j J r 311 of the degradation signal. Nor 

lt: clear whether this -is -. 

er cms xs the only mechanism employed 
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for short-lived messages, or if there are different 
classes of RNAs, each with its own degradative system. 
See . M- Cole and S.E. Mango, Enzyme 44=167-180 (1990) for 
a review; see also . T. Jones and M. Cole, Mol. Cell. 
Biol. 7:4513-4521 (1987) . Mutation of the only copy of 
5 the AUUUA sequence in the c- myc RNA INS region has no 

effect on RNA turnover, therefore the inhibitory sequence 
may be quite different from that of GM-CSF (M. Cole and 
S.E. Mango, Enzyme 44:167-180 (1990)), or else the mRNA 
instability may be due to the presence of additional INS 
10 regions within the mRNA. 

Previous workers have made mutations in genes 
encoding AU-rich inhibitory/ instability sequences within 
the 3' UTR of their transcript mRNAs. For example, G. 
Shaw and R. Kamen, Cell 46:659-667 (1986), introduced a 51 
15 nucleotide AT-rich sequence from GM-CSF into the 3' UTR of 
the rabbit /3- glob in gene. This insertion caused the 
otherwise stable /S -globin mRNA to become highly unstable 
in vivo , resulting in a dramatic decrease in expression of 
/J-globin as compared to the wild- type control. The 
20 introduction of another sequence of the same length, but 
with 14 G's and C's interspersed among the sequence, into 
the same site of the 3' UTR of the rabbit jS-globin gene 
resulted in accumulation levels which were similar to that 
of wild-type 0-globin mRNA. This control sequence did not 
25 contain the motif AUUUA, which occurs seven times in the 

AU-rich sequence. The results suggested that the presence 
of the AU-rich sequence in the 0- globin mRNA specifically 
confers instability. 

A. Shyu et al., Gen. & Devel. 5:221-231 (1991), 
30 studied the AU-rich INS in the 3' UTR of c-fos by 

disrupting all three AUUUA pentanucleotides by single U- 
to-A point mutations to preserve the AU- richness of the 
element while altering its sequence. This change in the 
sequence of the 3' UTR INS dramatically inhibited the 
35 ability of the mutated 3' UTR to destabilize the (8-globin 
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message when inserted into the 3 ' utr of a a? 

as compared to the wild-type INS ? * * 

coding region INS (which ITst , C "** Pr ° tein - 

3' UTR thc ) structurally unrelated to the 

utr INS) was studied by ins P rn n ^ • «. • ^ 
coding region of a a d k inSertln ^ lfc frame into the 
5 „• /J-globxn and observing the effect of 

deletions on the stability of tn- u ertect of 

globin mRNA. 7 ^ heterol ^°us c-Xaa-0- 

Previous workers have also made mutation • 
genes encoding inhibitory/instability mUtatlons ln 

coding region of tw lnStahlllt Y sequences within the 

10 CtoJ-J^J^'^^-"™-" For example, p. 

*fi»1138-li42 ( 9W de Nat1 ' SCi " USA < 

xi42 (1989) demonstrated the presence of a 

negative control reqion th*i- i • * 

region that lies deep in the rnrtin^ 
sequence of the p ^ coding 

„ <«> gene, ^t^r^ 1 ''^ 6 ' ^S^* 
» - cloning of a «^ Z^^S^*"* * 

sequence- (ICS) and observing the effect of th' 
complementary element on geni e xL!f ! ^ 

several sites within ex P^s S1 on when placed at 

&lces within the qnd oens t«v,« r £ 
and double mut a n„ • effect of single 

uouoie mutations introduced intn m„ 
20 element by site-dir^oH synthetic ICS 

expression^ If 7 muta 9 ene ^ on regulation of 

idcz rusion gene cnrroi=i-^ 
ability of the reqrv.nf- correlated with the 

r wj. cne respective mRNAs to fold 
structures that sequester th. Z secondary 
Thus, the gnd geneTlnt " S±te - 

25 to function as a cis-a^LT 

cis -acting antisense RNA 

s^ences 1 J^^T^T <° """" 

fragment carrying the region from - so to .253 , th! T 
re gi on starts at ««, was mutagenized II \h f 9 
frame to lacz. Expression Z^" 63 and th ™ fused in 

plasmas c 0 . ttol ^T.zrbLrr t " 1 *" £rom variant 

ana ly zed. The mutation ! re a u ^TaT ^ 

35 as expected from the mutagenesis oroce, tranSi " ons ' 

agenesis procedures used. Among 
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other mutations, a single base substitution at +253 
resulted in greatly increased expression of the btuB-lacZ 
gene fusion under both repressing and nonrepressing 
conditions . 

R. Wisdom and W. Lee, Gen. & Devel. 5.: 232-243 
5 (1991) , conducted an experiment which showed that mRNA 
derived from a hybrid full length c-myc gene, which 
contains a mutation in the translation initiation codon 
from ATG to ATC, is relatively stable, implying that the 
c- myc coding region inhibitory sequence functions in a 
10 translation dependent manner. 

R. Parker and A. Jacobson, Proc. Natl. Acad. 
Sci. USA 87:2780-2784 (1990) demonstrated that a region of 
42 nucleotides found in the coding region of Saccharomyces 
cerevisiae MATal mRNA, which normally confers low 
15 stability, can be experimentally inactivated by 

introduction of a translation stop codon immediately 
upstream of this 42 nucleotide segment. The experiments 
suggest that the decay of MATal mRNA is promoted by the 
translocation of ribosomes through a specific region of 
20 the coding sequence. This 42 nucleotide segment has a 
high content (8 out of 14) of rare codons (where a rare 
codon is defined by its occurrence fewer than 13 times per 
1000 yeast codons (citing S. Aota et al., Nucl. Acids. 
Res- I£:r315-r402 (1988))) that may induce slowing of 
25 translation elongation. The authors of the study, R. 

Parker and A. Jacobson, state that the concentration of 
rare codons in the sequences required for rapid decay, 
coupled with the prevalence of rare codons in unstable 
yeast mRNAs and the known ability of rare codons to induce 
30 translational pausing, suggests a model in which mRNA 
structural changes may be affected by the particular 
positioning of a paused ribosome. Another author stated 
that it would be revealing to find out whether (and how) a 
kinetic change in translation elongation could affect mRNA 
35 stability (M. Hentze, Bioch. Biophys. Acta 1090:281-292 
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(1991)). R. Parker and A. Jacobson, note, however, that 
the stable PGK1 mRNA can be altered to include up to 40% 
rare codons with, at most, a 3 -fold effect on steady-state 
mRNA level and that this difference may actually be due to 
a change in transcription rates. Thus, these authors 
conclude, it seems unlikely that ribosome pausing per se 
is sufficient to promote rapid mRNA decay. 

None of the aforementioned references describe 
or suggest the present invention of locating 
inhibitory/instability sequences within the coding region 
of an mRNA and modifying the gene encoding that mRNA to 
remove these inhibitory/instability sequences by making 
multiple nucleotide substitutions without altering the 
coding capacity of the gene. 

15 111 - DISCLOSURE OP THE! INVEHTTOW . 

The invention relates to methods of increasing 
the stability and/or utilization of a mRNA produced by a 
gene by mutating regulatory or inhibitory/ instability 
sequences (INS) in the coding region of the gene which 
prevent or reduce expression. The invention also relates 
to constructs, including expression vectors, containing 
genes mutated in accordance with these methods and host 
cells containing these constructs. 

As defined herein, an inhibitory/ instability 
sequence of a transcript is a regulatory sequence that 
resides within an mRNA transcript and is either (l) 
responsible for rapid turnover of that mRNA and can 
destabilize a second indicator/reporter mRNA when fused to 
that indicator/reporter mRNA, or is (2) responsible for 
underutilization of a mRNA and can cause decreased protein 
production from a second indicator/reporter mRNA when 
fused to that second indicator/reporter mRNA or (3) both 
of the above. The inhibitory/instability sequence of a 
gene is the gene sequence that encodes an 

inhibitory/instability sequence of a transcript. As used 
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herein, utilization refers to the overall efficiency of 
translation of an mRNA. 

The methods of the invention are particularly- 
useful for increasing the stability and/or utilization of 
a mRNA without changing its protein coding capacity. 
However, alternative embodiments of the invention in which 
the inhibitory/ instability sequence is mutated in such a 
way that the amino acid sequence of the encoded protein is 
changed to include conservative or non- conservative amino 
acid substitutions, while still retaining the function of 
the originally encoded protein, are also envisioned as 
part of the invention. 

These methods are useful for allowing or 
increasing the expression of genes which would otherwise 
not be expressed or which would be poorly expressed 
15 because of the presence of INS regions in T:he mRNA 

transcript. The invention provides methods of increasing 
the production of a protein encoded by a gene which 
encodes an mRNA containing an inhibitory/instability 
region by altering the portion of the nucleotide sequence 
of any gene encoding the inhibitory/ instability region. 

The methods, constructs and host cells of the 
invention are useful for increasing the amount of protein 
produced by any gene which encodes an mRNA transcript 
which contains an INS . Examples of such genes include, 
for example, those coding for growth factors, interferons, 
interleukins , and the fos proto- oncogene protein, as well 
as the genes coding for HIV-1 gag and env proteins. 

The method of the invention is exemplified by 
the mutational inactivation of an INS within the coding 
region of the HIV-1 gag gene which results in increased 
gag expression, and by constructs useful for Rev- 
independent gag expression in human cells. This 
mutational inactivation of the inhibitory/instability 
sequences involves introducing multiple point mutations 
into the AU-rich inhibitory sequences within the coding 
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region of the gag gene which, due to the degeneracy 

by vectors^onr"^ 0 ' 3 °' inVenti ° n a ~ ~*l±*ied 

tavT^! C ° ntainin ^ the W env, and pel genes which 

invent * " aCC ° rdanCe ^ the me thods of t his 

ZTIZ h ° SC CSllS by human 

HLtat cells containing these vectors. 

The invention also relates to using the 
constructs of the invpnh^n • • 

i cne invention m immunotherapy and 

immunoprophylaxis, e.g as s 

^ as a vaccine, or in genetic 

therapy after expression in humans Sl „h genetlc 
include or be incorporate ■ 7^ constructs can 

oe incorporated into retroviral vectors or 
other expression vectors or- ^ vectors or 

These constructs ray also be used for or in- vitro 

exemnlif inVenti ° n also to certain 

exemplif ied constructs which can be used to simply and 

rapidly detect and/or further defino ,-*, P±y and 

<TlV ,.. ... .. " ner define the boundaries of 

inhibitory/instabilitv so ffl , 0 n^ 0 • 

-r c^xxicy sequences m any mRNA which i = 

further aefine/ high hi ^ ^ 

apacxty of the gene, thereby allowing an increase in rh 

relate / !! stated genes. The invention also 
relates to methods of nqina t-v,~ 

cells e0 nf-, • • 5 6 const ^cts and to host 

cells containing these constructs. 

used to dereL" 113 ^" 5 °' inVenti °° which can be 

to detect -stability/inhibitory regions within an 
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mRNA are exemplified by the vectors, pl9, pl7M1234, 
p37M1234 and p37Ml-10D, which are set forth in Fig. 1. (B) 
and Fig. 6. p37M1234 and p37Ml-10D are the preferred 
constructs, due to the existence of a commercially 
available ELISA test which allows the simple and rapid 
5 detection of any changes in the amount of expression of 
the gag indicator/reporter protein. However, any 
constructs which contain the elements depicted between the 
long terminal repeats in the afore -mentioned constructs of 
Fig. 1. (B) and Fig. 6, and which can be used to detect 

10 instability/ inhibitory regions within a mRNA, are also 
envisioned as part of this invention. 

The existence of inhibitory/ instability 
sequences has been known in the art, but no solution to 
the problem which allowed increased expression of the 

15 genes encoding the mRNAs containing these sequences within 
coding regions by making multiple nucleotide 
substitutions, without altering the coding capacity of the 
gene, has heretofore been disclosed. 

IV, BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. 1. (A) Structure of the HIV-1 genome- Boxes indicate 
the different viral genes. (B) Structure of the gag 
expression plasmids ( see infra ) . Plasmid pl7 contains the 
complete HIV-1 5' LTR and sequences up to the BssHII 
restriction site at nucleotide (nt) 257. (The nucleotide 
numbering refers to the revised nucleotide sequence of the 
HIV-1 molecular clone pHXB2 (G. Myers et al. f Eds. Human 
retroviruses and AIDS. A compilation and analysis of 
nucleic acid and amino acid sequences (Los Alamos National 
Laboratory, Los Alamos, New Mexico, 1991) , incorporated 
herein by reference) . This sequence is followed by the 
pl7 gaff coding sequence spanning nt 336-731 (represented as 
an open box) immediately followed by a translational stop 
codon and a linker sequence. Adjacent to the linker is 
the HIV-1 3' LTR from nt 8561 to the last nucleotide of 
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the US region. Plasmid pi7R contains in 

nt Styl fragment encompassing ^ - add — n th e 330 
J Virol 64:6010-6017 (19 90 ) ) < ^ et al ' ' 

box) 3' to the P 17- codina (r6PreSented as * -"PPl-d 

« pj-/ coding sequence. The prp < a -, -. 

. by HIV-i sequences from nt 8021 to th. T ^ 
> the US region of the 3' ltr I, " nUCleotide <* 

generated by replacing ^ Pl9 ^ Pl9R -™ 

" «y replacing the HIV-i D i7e«e rnH . n „ 

plasmids pi 7 and m™ coding sequence in 

codina respectively, with the RSV pl9 ~ 

region, indicated by XXX w a „, ? 939 C ° dln3 

seances. ^/^JZ^Z^T^ 
P37M1234 are described ""^ 731 -" 2 *> <«* P^smid 

aescri Ption . z:: :iz: :TV eiow *** in che 

which can be used to det^:^ °* r"™" 

nucleotide seouence encodes eth - * P«t 1C ular 

vector pl7Ml234, which c M f»- instance, 
n i7»«i contains an indicator gene <her» 

P17™, represents the control vector „„,, 

P17MX234 (731-1424, and P37M1234 " 

the nucleotide ^nJlTlllilTZ™ *" ^ 

j-^cerest (here fh© y^o/ig«e ~ -j . 
region) is inserted into t-h- P coding 

eu mto the vector eii-hoy , 
codon of the indi^,-^ either 3' to the stop 

-dm, r, 3lo ::t2?:™:z 3 ;„r ed in tr ™ - 

gag INS and for ft,-i. h . laentif ication of 

ror further mutagenesis. D l7Mi^A 
a vector to insert additional Hiv , Pl?Ml234 was ^ as 
downstream from the codina ^ SegUences 

gene. Three different f ^ ^ » ltered 

numbers were in ser tld V™ 9 "™- -^otide 

below, to genera" 1 / eCt ° r Pl ™ 1234 deS ~ ibed 

generate plasmids P17M1234 (7T, ia<.,, 

P17M1234 ,731-1424, and P17M1234 ,731 2 lfi ^ 

fragments were inserted 3- " IZ ' ' indic "^ 

coding seouence in plTMlL in . ' C ° d ° n ° £ P "~ 

not shown,, pi7„i23!,731 1081, ^ aSsavs <*"» 

WJ1 1081) and pl7M1234f7^i n>,o^ 

expressed high levels of pl7 « protein t* 

p---' protein. i n contrast. 
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pl7M1234 (731-2165) did not express pl7 gag protein, 
indicating the presence of additional INS within the HIV-l 
gag coding region. To generate plasmids pl7M1234 (731- 
1081) NS, p37M1234 and p55M1234, the stop codon at the end 
of the altered pl7 gae gene and all linker sequences in 
5 P 17M1234 were eliminated by oligonucleotide-directed 

mutagenesis and the resulting plasmids restored the gag 
open reading frame as in HIV-l. In expression assays 
(data not shown) p37M1234 expressed high levels of protein 
as determined by western blotting and ELISA assays whereas 
10 p55M1234 did not express any detectable gag protein. 
Thus, the addition of sequences 3 7 to the p24 region 
resulted in the elimination of protein expression, 
indicating that nucleotide sequence 1424-2165 contains an 
INS. This experiment demonstrated that p37M1234 is an 
15 appropriate vector to analyze additional INS. 

Fig. 2. Gag expression from the different vectors. (A) 

HLtat cells were transfected with plasmid pl7, pl7R, or 
pl7M1234 in the absence (-) or presence (+) of Rev (see 

20 infra ) . The transfected cells were analyzed by 

immunoblotting using a human HIV-l patient serum. (B) 
Plasmid pl9 or pl9R was transfected into HLtat cells in 
the absence (-) or presence (+) of Rev. The transfected 
cells were analyzed by immunoblotting using rabbit and 

25 anti-RSV pl9 el& serum. HIV or RSV proteins served as 

markers in the same gels. The positions of pl7 gag and pl9 gag 
are indicated at right. 

Fig. 3. mRNA analysis on northern blots. (A) HLtat cells 
30 were transfected with the indicated plasmids in the 

absence (-) or presence (+) of Rev. 20 fig of total RNA 
prepared from the transfected cells were analyzed (see 
infra ) . (B) RNA production from plasmid pl9 or pl9R was 
similarly analyzed in the absence (-) or presence (+) of 
35 Rev. 
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ThTl!" , NUCle ° tide Se ^-« of the Hiv-i pl7 ~ region 

The locations of the 4 oligonucleotides (M1-M4) 
generate all mutants are underl±ned ^ ^Jj*** t0 

nucleotide substitutions introduced bv Ml . h 
oligonucleotide ^ • „• ° aUCed h y each mutagenesis 

n nucleotide are indicated below the ™rHm~ „ 

Numbering starts from nt + i of th^ V SSqUence - 

iic +i or the viral mRNA. 

, une abse nce of Rev. ni7« 

production was ^00=,^^ i~ ^ 
Fig. 2. 7 lmmunobl °ttin g as described in 



Pi ff- 6. Expression vectors used <„ «. . ^ 

PIS - protease and p SS « proteins. VectQr K J ^ 
further mutagenic using different combinations o t 

^; r e ::::: sio r r ined ~ ^ <* 

found to cont I" JLTol"^"? ^ ™* 

different combinations of oii" did TJT^™' 

in egression, or only partial L " lnCreaSe 

PSSBM1-X0 and pSM-Jo^ 

P55M1-I3pn ™^ • derived from P 37M1-10D. 

P55M1 13P0 contains additional mutations in the aaa sn n 
pol regions included in t-h« „i ■ S 9 and 

mio oligonucleotides Mlioaa 

M12gag, Ml3gag and MOpol The h»t-n>, „ w M11 ^g, 

p ine hatched boxes indicate t-h a 

location of the ^an- «n -"-"axcace tne 

containing ci^t^T^— ""^ *— 

«» seouences. which ^ ^t™. 
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and/or inhibition of the mRNA; and the open boxes 
containing triangles indicate mutated regions containing 
AATAAA sequences, which may contribute to instability 
and/or inhibition of the mRNA. Typical levels of p24 8ag 
expression in human cells after transf ections as described 
5 supra are shown at the right (in pg/ml) . 

Fig. 7. Eukaryotic expression plasmids used to study env 
expression. The different expression plasmids are derived 
from pNL15E (Schwartz, et al. J. Virol. 64:5448-5456 

10 (1990) . The generation of the different constructs is 
described in the text. The numbering follows the 
corrected HXB2 sequence (Myers et al-, 1991, supra ; Ratner 
et al., Hamatol. Bluttransf us . 31:404-406 (1987); Ratner 
et al., AIDS Res. Hum. Retroviruses 3:57-69 (1987); 

15 Solomin, et al. J. Virol. 64:6010-6017 (1990), starting 
with the first nucleotide of R as +1. 5'SS, 5' splice 
site; 3'SS, 3' splice site. 

Fig* 8. Env expression is Rev dependent in the absence of 
20 functional splice sites. Plasmids plSESD- and plSEDSS (C) 
were transfected in the absence or presence of a rev 
expression plasmid (pL3crev) into HLtat cells. One day 
later, the cells were harvested for analyses of RNA and 
protein. Total RNA was extracted and analyzed on Northern 
25 blots (B) . The blots were hybridized with a 

nick- translated probe spanning XhoI-SacI (nt 8443 to 9118) 
of HXB2. Protein production was measured by western blots 
to detect cell -associated Env using a mixture of HIV-l 
patient sera and rabbit anti-gpl20 antibody (A) . 

30 

Fig. 9. Env production from the gpl20 expression plasmids. 

The indicated plasmids were transfected into HLtat cells 
in duplicate plates. A rev expression plasmid (pL3srev) 
was cotransfected as indicated. One day later, the cells 
35 were harvested for analyses of RNA and protein. Total RNA 
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was extracted and analyzed on Northern blots (A) The 
blots were hybridized using a nick- translated probe 
spanning nt 6158 to 7924. Protein production (B) was 
measured by immunoprecipitation after labeling for 5 h 
with 200 mCi/ml of »S-cy S teine to detect secreted 
processed Env (gpi20) . 

Fig. 10. The identification of INS elements within gpi 20 
and gp 4 l using the pl9 (RSV gag) test system. Schematic 
structure of exon 5E containing the env ORP. Different 
fragments (a to G) of the g P 4i portion and fragment H of 
the vpu/gpi20 portion were PGR amplified and inserted into 
the unique EcoRI site located downstream of the RSV gag 
gene in pi 9 . The location of the sequences included in 
the amplified fragments is indicated to the right using 
HXB2R numbering system. Fragments A and B are amplifil 
from pNLlSE and pNLlSEDSS ( in which the spl±ce acc 
sites 7A, 7B and 7 have been deleted) respectively, using 
the same oligonucleotide primers. They are 276 and 234 
nucleotides long, respectively. Fr3Lgment c was 
from pNLlSEDSS as a 323 nucleotide fragment. Fragment F 
' S 3 HpaI ' K P nI restriction fragment of 362 nucleotides 

Fragment E was amplified as a 668 nucleotide f raam ^ I 
r>WT t cmoe ^ ""i-ieociae tragment from 

pNLlSEDSS, therefore the manor c n iin fl ^„ 

5592 nf uyn, u w splice donor at nucleotide 

5592 of HXB2 has been deleted. The rest of the fragments 
were amp f ied from pNL15E as indicated ±n ^ ^ 

dav lat! \T Te tranSfSCted With constructs. One 

was dete ■ „ *** Pl9 ^ Production 

was determined by Western blot analysis using the 

plL^r 9 antib ° dy - ^ eXpreSSi - of Gag from these 
Plasmids was compared to Gag production of pl9 . SA splice 
acceptor; B, Bam* I; H, Hpal; x, Xhol; K, KpnX. The dowT 
regulatory effect of INS contained within the dif f erenT 
fragments is indicated at right. 

Fig * 11 • Tne identification nf ™c -i 

"ncation of INS elements within gpl20 
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and gp41 using the p37Ml-10D (mutant INS p37 m expression 
system) test system. Schematic structure of the env ORF. 

Different fragments (1 to 7) of env were PCR amplified as 
indicated in the figure and inserted into the polyl inker 
located downstream of the p37 mutant gag gene in 
p37Ml-10D. Fragments 1 to 6 were amplified from the 
molecular clone pLW2.4, a gift of Dr. M. Reitz, which is 
very similar to HXB2R. Clone pLW2.4 was derived from an 
individual infected by the same HIV-1 strain IIIB, from 
which the HXB2R molecular clone has been derived* 
Fragment 7 was cloned from pNL43 . For consistency and 
clarity, the numbering follows the HXB2R system. HLtat 
cells were transf ected with these constructs . One day 
later, the cells were harvested and p24 gag production was 
determined by antigen capture assay. The expression of 
Gag from these plasmids was compared to Gag production of 
p37Ml-10D. The down regulatory effect of each fragment is 
indicated at right. 

Fig. 12. Elimination of the negative effects of CRS in 
the pol region. Nucleotides 3700-4194 of HIV-1 were 
inserted in vector p37M1234 as indicated. This resulted 
in the inhibition of gag expression. Using mutant 
oligonucleotides M9pol-M12pol (P9-P12) , several mutated 
CRS clones were isolated and characterized. One of them, 
p37M1234RCRSP10+P12p contains the mutations indicated in 
Fig. 13. This clone produced high levels of gag. 
Therefore, the combination of mutations in 
p37M1234RCRSP10+P12p eliminated the INS, while mutations 
only in the region of P10 or of P12 did not eliminate the 
INS. 

Fig. 13. Point mutations eliminating the negative effects 
of CRS in the pol region (nucleotides 3700-4194) . The 

combination of mutations able to completely inactivate the 
inhibitory/instability element within the CRS region of 



BNSDOCID: <WO 932021 2A1_I_> 



WO 93/20212 




PCT/US93/02908 



- 19 - 



HIV i pol (nucleotides 3700-4194) i- 

spn , lon „ Q . t-Lvt) is shown under the 

sequence m small letters Tv, D c fl 

mi onr>i _ ^ . , p ana M l2pol see Table 2) 

Mi2pol oligonucleotide contains , 
were nnf ^- „ contains additional ntutations that 

were not introduced into P37M1234RCRSP10 + Pi2p (see Fi q 
12), as determined by DNA sequencing. 

Pig. 14. Plasmid nap and nudeotlHo «, 
effinio«i- „ nucleotide sequence of the 

efficient gag expression vector P 37M1-1 0 D. < A) Plasmid 
aap of vector P 37M1-10D The i Plasmid 

10 D i The Plasmid contains a 

pBluescriptKS(-) backbone human a™ ■ 
flanking the htv , genomic sequences 

done, hxv : LTRS a : d e T ce : r found ±n pnl - ~- 

P17 region has L * reg± ° n < Pl? p24) • 

v-<-/ -tegion has been mutaqenizeri no-ir,,, ~ n • 

to M4, and the n 2 4 ™- T ? 0ll ^ucleotides Ml 

15 , . P24 re 3 lon has been mutagenized M i n „ 

oligonucleotides «. M7 , M8 and Mio, as describe! ^ 

r^-i^riT h region for p3? is r l 
zzzzr**- — -i 0 r: f — 

,h« 1V ; ly - The transcribed portions of the LTRs ar P 
shown m black.. tv,~ _ . e iaRs are 



25 



sxiown m black.- The tranqi a n« , 

the end of the p24 Ld" ^ ^ ±nSerted at 

1818. Some restrict^ at P ° Siti °* 

e restriction endonuclease rip avarto ■ *. 

also indicated. (3-D, Complete nZllll T 
P37M1-10D The "P-tete nucleotide sequence of 

P Ml 10D. The ammo acid sequence of the p37 ~ Drot< , 1n 
1S shown under the coding region Svmhnl P«texn 

Numbering starts at the t±~ t \ *=* ah °^' 

at the first nucleotide of the 5 ' LTR . 



30 * mDE§ ^ m -^^m^gT_rHEi Ss ^^ 

general aellZ.TonlTTT" ^ ^ ^ f ° rSg0i ^ 

restrictive of tho -i mr«»,*.^ _ 

tne invention, as claimed. The 

accompanying drawings which ^ • 
35 constitute a part 0 Vl» "eorporated in and 

part of the specification, illustrate an 
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embodiment of the invention and, together with the 
description, serve to explain the principles of the 
invention. 

The invention comprises methods for eliminating 
intragenic inhibitory/instability regions of an mRNA by 
5 (a) identifying the intragenic inhibitory/instability 
regions, and (b) mutating the intragenic 
inhibitory/ instability regions by making multiple point 
mutations. These mutations may be clustered. This method 
does not require the identification of the exact location 
10 or knowledge of the mechanism of function of the INS. 
Nonetheless, the results set forth herein allow the 
conclusion that multiple regions within mRNAs participate 
in determining stability and utilization and that many of 
these elements act at the level of RNA transport, 
15 turnover, and/or localization. Generally, the mutations 
are such that the amino acid sequence encoded by the mRNA 
is unchanged, although conservative and non- conservative 
amino acid substitutions are also envisioned as part of 
the invention where the protein encoded by the mutated 
20 gene is substantially similar to the protein encoded by 
the non-mutated gene. 

The nucleotides to be altered can be chosen 
randomly, the only requirement being that the amino acid 
sequence encoded by the protein remain unchanged; or, if 
25 conservative and non- conservative amino acid substitutions 
are to be made, the only requirement is that the protein 
encoded by the mutated gene be substantially similar to 
the protein encoded by the non-mutated gene. 

If the INS region is AT rich or GC rich, it is 
30 preferable that it be altered so that it has a content of 
about 50% G and C and about 50% A and T. If the INS 
region contains less -preferred codons, it is preferable 
that those be altered to more -preferred codons. If 
desired, however (e.g., to make an A and T rich region 
35 more G and C rich) , more -preferred codons can be altered 
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to less -preferred codons. if the INS 



unserved nucleotides 
could be altered t 



region contains 



o non 



only requirement is that the _ 
by the protein remain unchanged 
non- conservative amino acid substitutions' 



some of those conserved nucleotides 
-conserved nucleotides. Again, the 



amino acid sequence encoded 

conservative and 



or, if 



the only requirement is that 

mutated gene be substantially similar to the prot 
encoded by the non-mutated P 



are to be made 
the protein encoded by the 



ein 



gene 



As used herein 
evolutionarily conserved 
since this conservation may 
part of a signal involved in the'inhlb 
determination. 



conserved nucleotides means 
nucleotides for a given gene, 
reflect the fact that they 

tory/instability 

be 

about the gene of 



are 



determined , Conse ™* nucleotides can generally 

interlT ^ blish ^ Terences about the gene 

interest or can be determined by using a variety of 



computer programs availabl 



e to 



Less- 



practitioners of the art 



— — ' c Aicr ax 

preferred and more -preferred * 
various organisms can be determined / 
charts, such „ e ^termmed from codon usage 



20 Nucl. Acids Res 



such as those se»t- fnri-v, • ™ 

set forth xn T. Maruyama 

1 H O Dor, -I jt ** 



et al 



al., Nucl. Acids. Res 



i£:z*15l-rl97 [igflci -,r,^ • 

iiyb6j and xn s. Aota 



of a 

Patent No. 5 



I£:r315-r402 (1988) 



et 



computer program, such as that discl 
~ 082,767 entitled "Codon Pai 



or 



through 



use 



issued to G. w. Hatfield et al 



osed in U.S. 
r Utilization 



II 



XS 1 



ncorporated herein by reference. 



on January 21, 1992, which 



Generally, the method 



carried out as follows 



of the invention 



is 
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--^fat^tcfal^^ 

is usually Proportional t: tne " :: i U a s ar Pr r ein " 
-ttNA which encodes it Th„. CyC ° PlaSmc Ievel °« the 

containing an ^oj^^^ " — ^ 
-» or protein is either not ^e^lrZlsSl 
expressed poorly as compared to the leve'l Tl^tZ^ ot 
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a reference mRNA or protein under the control of the same 
or similar strength promoter. Differences in the steady 
state levels of a particular mRNA (as determined, for 
example, by Northern blotting) , when compared to the 
steady state level of mRNA from another gene under the 
5 control of the same or similar strength promoter, which 

cannot be accounted for by changes in the apparent rate of 
transcription (as determined, for example, by nuclear run- 
on assays) indicate that the gene is a candidate for an 
unstable mRNA. In addition or as an alternative to being 
10 unstable, cytoplasmic mRNAs may be poorly utilized due to 
various inhibitory mechanisms acting in the cytoplasm . 
These effects may be mediated by specific mRNA sequences 
which are named herein as "inhibitory sequences". 

Candidate mRNAs containing 
15 inhibitory/instability regions include mRNAs from genes 
whose expression is tightly regulated, e.g., many 
oncogenes, growth factor genes and genes for biological 
response modifiers such as interleukins . Many of these 
genes are expressed at very low levels, decay rapidly and 
20 are modulated quickly and transiently under different 

conditions. The negative regulation of expression at the 
level of mRNA stability and utilization has been 
documented in several cases and has been proposed to be 
occurring in many other cases. Examples of genes for 
25 which there is evidence for post- transcriptional 

regulation due to the presence of inhibitory/ instability 
regions in the mRNA include the cellular genes encoding 
granulocyte -monocyte colony stimulating factor (GM-CSF) , 
proto- oncogenes c- myc . c- myb . c-sis, c-fos; interferons 
30 (alpha, beta and gamma IFNs) ; interleukins (IL1, IL2 and 

IL3); tumor necrosis factor (TNF) ; lymphotoxin (Lym) ; IgGl 
induction factor (IgG IF) ; granulocyte colony stimulating 
factor (G-CSF) ; transferrin receptor (TfR) ; and 
galactosyltransferase-associated protein (GTA) ; HIV-1 
35 genes encoding env, gag and pol; the EL. coli genes for 6- 
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phosphogluconate dehydrogenase ,gnd> and btuB- and th 
yeast gene for MftTal (s „ Eh . . UB ' 3,13 che 

ne discussion in the 
Background Art- section, above) Tha 

cellular proto- oncogenes c-^ and ^J*™ «*• 

; ^t gene for and the^C > " " the 

£L£ T " hiCh the " iS fil ter ^ 

within the non- coding region ' 0ry/lMt:ablllt y r ^ions 
of encoding mRNAs co ' encoding or suspected 

ng mRNAs containing inhibitory/instabil i «-„ 

~J^r — ^ -"^relevant 

ut iliZ ed jsrjrr^rsrr unscabie or pooriy 

« can he «^J^^JZ£Z! " 

experxments (i e ^ cnase 

radioactive .U^^T nSKly s™ 1 ^ «»- »"h a 
radiolabeled J£ 11 \T r 9 ^ ° f C1 * 

introducing in^tro ra na ° f ^ ' ° r * 

(either by mllrotStt * *** ^ tar3et cel1 * 

p-cipitaLon : e :: P ::;ti c o alciu,n phosphate c °- 
«- «, to .onitorizL T ods knoun in 

■« Population; or by e^retstna tH **" 
fro. a propter „hich cal^b ^ rjTVT* 
transcription soon after ind,,^ ShUCS ° ff 

o f the ^r: hi h :: s :y;thT es d ti,,atin9 che 

short transcriptional burst- or bHl w ^ 
Pharmacologically (9 g ^' * * bl °<*«3 transcription 

the decay of the partLi ACtln ° mycin D > «* following 

e Particular mRNA at V aHn,,o 
after the addition of t-H» „ various time points 

ion of the drug by Northern hi . 
protection (e o si „„„i » flern Wotting or rna 

-ove deteJinltil I tl ^ /T^ "» 
M.w. Hentze et al B1 J h "tablrshed. sgg, e .g., 

and reference^^ tl^ — 
S. Schwartz et al t v ■> s±so, 

useful ^easur^ent'is hl:tch '"T " "» ~ e 

protein is produced, 
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because this includes all possible INS mechanisms. 
Examples of various mRNAs which have been shown to contain 
or which are suspected to contain INS regions are 
described above. Some of these mRNAs have been shown to 
have half -lives of less than 30 minutes when their mRNA 
5 levels are measured by Northern blots (see, e.g., D. 

Wreschner and G. Rechavi, Eur. J. Biochem. 172:333-340 
(1988) ) . 

2. Localization of Instability De terminants 
10 When an unstable or poorly utilized mRNA has 

been identified, the next step is to search for the 
responsible (cis-acting) RNA sequence elements. Detailed 
methods for localizing the cis -acting 

inhibitory/instability regions are set forth in each of 
15 the references described in the "Background Art" section, 
above, and are also discussed infra. The exemplified 
constructs of the present invention can also be used to 
localize INS (see below) . Cis acting sequences 
responsible for specific mRNA turnover can be identified 
20 by deletion and point mutagenesis as well as by the 

occasional identification of naturally occurring mutants 
with an altered mRNA stability. 

In short, to evaluate whether putative 
regulatory sequences are sufficient to confer mRNA 
25 stability control, DNA sequences coding for the suspected 
INS regions are fused to an indicator (or reporter) gene 
to create a gene coding for a hybrid mRNA. The DNA 
sequences fused to the indicator (or reporter) gene can be 
cDNA, genomic DNA or synthesized DNA. Examples of 
30 indicator (or reporter) genes that are described in the 
references set forth in the "Background Art" section 
include the genes for neomycin, /3-galactosidase, 
chloramphenicol actetyltransf erase (CAT) , and lucif erase, 
as well as the genes for j8- glob in, PGK1 and ACT1. See 
35 also Sambrook et al., Molecular Cloning. A Laboratory 
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Manua 1 f 2d. ed r^i * o 

Harbor, Ne „ York 19e9 P r n9 Cold Spring 

"Men can be used as T 16 ' 56 -"- 67 - genes 

< i.e.. the gag gL o£ 1 B 9eneS ^ diSCl ° Sed 
. » inhibitoryji^ y reZra^T 3 '"^ ^ 

' hiv-! gag genes of construct^™"' \ ina ™« 
P37 M1 - 10D , which have been ™tate" \ P " 4 3nd 

inh ibicory/instabil ni^zT ° : nacci -" «*• 

aspect of tne in Venci L. e9 : n ge a n nd -- h .- stitute one 

encoding a m*N A which is ^l^ ^ZT "* »"» 
relatively high levels h erf a S b eXpressed « 
enough or expressed at hioh " S SCable 

decrease inThe level of L * ^ S ° ^ 

be detected by standard ^ " -»««<» **o<=.±» can 

indicator or r^T^Tl ^ «- " « 

P37MX234 and p3 7M1 . 10D 9 ^^^e " "* C< »"™« 

preferred for reasons 6t " ^ 

° £ create hybrid gen es usin g the e consT^^ 3 
testing the exr,r-«o«, • constructs and 

constructs a™ s'tl T ^ Ch « e 

e axso set forth below 

«- mSNAs ^S l ^'^ Ulty '«V« Of 

*ene s consisting of tL Z^^Z^^*™* 
sequences suspected of encoding an INS 1 
by transfecting the hybrid genls L! „ ! 

appropriate for the .xnr». CeUs " hlch « e 

uxie expression vector* no<=^ *. 

express the mRNAs. The reB11K * t0 Cl ° ne and 

determined by standard t 9 ° f are 

"y standard methods of dPtom,^' 

stability, e . g . Northem S1 r 9 ""^ 

and the resulting levels „, "«PPing or PGR methods, 

quantitated by protein " PrOCein Pr ° dUCed are 

i™noprecipiLL:rand~e 3 r:T:' ^ " ^ 
inhibitory/instability Li^T 

-re than one, „in be' ££ f ~ ^ " ™ 
protein expression and/or stab, f . Urease in the 

compared to the control • ° £ the hybrid mRNA as 

control indicator mRNA. Note that if the 
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ultimate goal is to increase production of the encoded 
protein, the identification of the INS is most preferably 
carried out in the same host cell as will be used for the 
production of the protein. 

Examples of some of the host cells that have 
5 been used to detect INS sequences include somatic 

mammalian cells, Xenoous oocytes, yeast and E. coli. See, 
e.g., G. Shaw and R. Kamen, Cell 46:659-667 (1986) 
(discussed supra ) which localized instability sequences in 
GM-CSF by inserting putative inhibitory sequences into the 
10 3' utr of the j3-globin gene, causing the otherwise stable 
0-globin mRNA to become unstable when transfected into 
mouse or human cells. See also I. Laird-Of f ringa et al., 
Nucleic Acids Res. 19:2387-2394 (1991) which localized 
inhibitory/instability sequences in c-rn^c using hybrid c- 
15 myc - neomycin resistance genes introduced into rat 

fibroblasts, and M. Lundigran et al., Proc. Natl. Acad. 
Sci. USA 88:1479-1483 (1991) which localized 
inhibitory/instability sequences in btuB gene by using 
hybrid btuB-lacZ genes introduced into EL_ coli. For 
20 examples of reported localization of specific 

inhibitory/ instability sequences within a transcript of 
HIV-1 by destabilization of an otherwise long-lived 
indicator transcript, see , e.g., M. Emerman, Cell 57:1155- 
1165 (1989) (replaced 3' UTR of env gene with part of HBV 
25 and introduced into COS-1 cells); S. Schwartz et al . , J. 
Virol. 66:150-159 (1992) (gag gene fusions with Rev 
independent tat reporter gene introduced into HeLa cells) ; 
F. Maldarelli et al., J. Virol. 65^:5732-5743 (1991) 
(gag/pol gene fusions with Rev independent tat reporter 
30 gene or chloramphenicol acetyltransf erase (CAT) gene 

introduced into HeLa and SW480 cells) ; and A. Cochrane et 
al., J. Virol. 65:5303-5313 (1991) (pol gene fusions with 
CAT gene or rat proinsulin gene introduced into COS-1 and 
CHO cells) . 

35 It is anticipated that in vitro mRNA degradation 
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systems (e.g., crude cytoplasmic extracts) to assay mRNA 
turnover in vifra will complement ongoing in vivo analyses 
and help to circumvent some of the limitations of the in 
viyo systems. See M.W. Hentze et al, , Biochim. Biophys . 
Acta 1090:281-292 (1991) and references cited therein. 
See also D. Wreschner and G. Rechavi, Eur. J. Biochem. 
172:333-340 (1988), which analyzed exogenous mRNA 
stability in a reticulocyte lysate cell -free system. 

In the method of the invention, the whole gene 
of interest may be fused to an indicator or reporter gene 
and tested for its effect on the resulting hybrid mRNA in 
order to determine whether that gene contains an 
inhibitory/ instability region or regions. To further 
localize the INS within the gene of interest, fragments of 
the gene of interest may be prepared by sequentially 
deleting sequences from the gene of interest from either 
the 5' or 3' ends or both. The gene of interest may also 
be separated into overlapping fragments by methods known 
in the art (e.g., with restriction endonucleases , etc.) 
See, e.g., s. Schwartz et al., J. Virol. 66.-150-159 
(1992) . Preferably, the gene is separated into 
overlapping fragments about 300 to 2000 nucleotides in 
length. Two types of vector constructs can be made. To 
permit the detection of inhibitory/instability regions 
that do not need to be translated in order to function, 
vectors can be constructed in which the gene of interest 
(or its fragments or suspected INS ) can be inserted into 
the 3' UTR downstream from the stop codon of an indicator 
or reporter gene. This does not permit translation 
through the INS. To test the possibility that some 
inhibitory/instability sequences may act only after 
translation of the mRNA, vectors can be constructed in 
which the gene of interest (or its fragments or suspected 
INS) is inserted into the coding region of the 
indicator/reporter gene. This method will permit the 
detection of inhibitory/instability regions that do need 
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to be translated in order to function. The hybrid 
constructs are transfected into host cells, and the 
resulting mRNA levels are determined by standard methods 
of determining mRNA stability, e.g. Northern blots, SI 
mapping or PCR methods, as set forth above and as 
described in most of the references cited in the 
"Background Art" section. See. also . Sambrook et al. 
(1989), supra , for experimental methods. The protein 
produced from such genes is also easily quantitated by 
existing assays, such as ELISAS, immunoprecipitation and 
western blots, which are also described in most of the 
references cited in the "Background Art" section. See 
alsa, Sambrook et al. (1989), supra , for experimental 
methods. The hybrid DNAs containing the 
inhibitory/ instability region (or regions, if there are 
more than one) will be identified by a decrease in the 
protein expression and/or stability of the hybrid mRNA as 
compared to the control indicator mRNA. The use of 
various fragments of the gene permits the identification 
of multiple independently functional 

inhibitory/ instability regions, if any, while the use of 
overlapping fragments lessen the possibility that an 
inhibitory/instability region will not be identified as a 
result of its being cut in half, for example. 

The exemplified test vectors set forth in Fig. 
1. (B) and Fig. 6 and described herein, e.g., vectors 
P17M1234, p37M1234, P37M1-10D and pl9 , can be used to 
assay for the presence and location of INS in various 
RNAs, including INS which are located within coding 
regions. These vectors can also be used to determine 
whether a gene of interest not yet characterized has INS 
which are candidates for mutagenesis curing. These 
vectors have a particular advantage over the prior art in 
that the same vectors can be used in the mutagenesis step 
of the invention (described below) in which the identified 
INS is eliminated without affecting the coding capacity of 
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the gene. 

The method of using these vectors involves 
xntroducing the entire gene, entire cDNA or fragments of 
the gene ranging from approximately 300 nucleotides to 

approximately 2 kilohsespe ■>> _v. 

. . 7 Kinases 3 to the coding region for gag 

protein usxng unique restriction sites which are 
engineered into the vectors. The expression of the gag 
gene in HLtat cells is measured at both the RNA and 
protein levels, and compared to the expression of the 
starting vectors, a decrease in expression indicates the 
presence of INS candidates that may be cured by 
mutagenesis. The method of using the vectors exemplified 
in Fig. ! herein involves introducing the entire gene and 
fragments of the gene of interest into vectors P 17M1234 

P37M1234 and pl9 The s-f 7 « n e * '. 

p ine Slze of the fragments are 

preferably 300-2000 nucleotides long. Plasmid DNA is 
prepared in £. coli and purified by the CsCl method. 

To permit detection of inhibitory/instability 
regions which do not need to be translated in order to 
function, the entire gene and fragments of the gene of 

llTTlT, intr ° dUCed lnt ° VSCt °" P^M1234, p37M1234 or 
P19 3 of the stop codon of the pi7- coding region To 

allow the detection of inhibitory/ instability regions that 
affect expression only when translated, the described 
vectors can be manipulated so that the coding region of 
the entire gene or fragments of the gene of interest are 
fused m frame to the expressed gag protein gene. For 
example, a fragment containing all or part of the coding 
region of the gene of interest can be inserted exactly I 
to the termination codon of the gag coding sequence in 
vector P37M1234 and the termination codon of gag and the 
linker sequences can be removed by oligonucleotide 
mutagenesis in such a wav as t-« ^,o« 

to th* r- M *- * hS gag readin 3 frame 

to the reading frame of the gene of interest. 

RNA and protein production from the two 

expression vectors (e.a n^ivn?^ ■ • 

i e.g. P37M1234 containing the fragment 
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of the gene of interest inserted directly 3' of the stop 
codon of the gag coding region, with the gag termination 
codon intact, and p37M1234 containing the fragment of the 
gene of interest inserted in frame with the gag coding 
region, with the gag termination codon deleted) are then 
compared after transfection of purified DNA into HLtat 
cells.. 

The expression of these vectors after 
transfection into human cells is monitored at both the 
level of RNA and protein production. RNA levels are 
quantitated by, e.g.. Northern blots, SI mapping or PCR 
methods. Protein levels are quantitated by, e.g., western 
blot or ELISA methods. p37M1234 and p37Ml-10D are ideal 
for quantitative analysis because a fast non- radioactive 
ELISA protocol can be used to detect gag protein (DUPONT 
15 or COULTER gag antigen capture assay) . A* decrease in the 
level of expression of the gag antigen indicates the 
presence of inhibitory/instability regions within the 
cloned gene or fragment of the gene of interest. 

After the inhibitory/ instability regions have 
been identified, the vectors containing the appropriate 
INS fragments can be used -to prepare single -stranded DNA 
and then used in mutagenesis experiments with specific 
chemically synthesized oligonucleotides in the clustered 
mutagenesis protocol described below. 
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Mutation of the Inhibitory/ Instability 
Regions to Generate Stable mRNAs 



Once the inhibitory/instability sequences are 
located within the coding region of an mRNA, the gene is 
modified to remove these inhibitory/instability sequences 
without altering the coding capacity of the gene. 
Alternatively, the gene is modified to remove the 
inhibitory/ instability sequences, simultaneously altering 
the coding capacity of the gene to encode either 
35 conservative or non- conservative amino acid substitutions 
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in the method of the invention, the most general 
method of eliminating the INS in th<=> rnrt< general 

s tIie XW£s ln the coding region of the 
gene of interest is by making multiple mutations in the 
INS region of the gene or gene fragments, without changing 
the ammo acid sequence of the protein encoded by the 

^•r;-" C ° nSerVative and non- conservative amino acid 
substitutions are to be made, the only requirement is that 
the protein encoded by the mutated gene be substantially 
similar to the protein encoded by the non-mutated gene 
It is preferred that all of the suspected 
inhibitory/instability regions, if more than one, be 
mutated at once. Later, if desired, each 

xnhibitory/instability region can be mutated separately in 
order to determine the smallest region of the gene that 

ZI JIT mUtated ^ ° rdSr t0 ' ******** * Stable 

The ability to mutagenic long DNA regions at the same 

time can decrease the time and effort needed to produce- 
the desired stable and/or highly expressed mRNA and 
resulting protein. The altered gene or gene fragments 
containing these mutations will then be tested in the 
usual manner, as described above, e.g., by fusing the 
altered gene or gene fragment with a reporter or indicator 

by the 1? f in9 l6Vel ° f ^ ^ Pr ° tein Educed 

by the altered genes after transfection into an 

appropriate host cell. if the level of mRNA and protein 

produced by the hybrid gene containing the altered gene or 

gene fragment is about the same as that produced by the 

th° e Hb c r tr : ct encodin9 oniy the indicat - 

elL ^""^lity regions have been effectively 

eliminated from the gene or gene fragment due to the 
alterations made in the INS. 

noinr , ^ ° f inVention ' "ore than two 

point mutations will be made in the INS region 

Optionally, point mutations may be made in at least about 
10. of the nucleotides in the inhibitory/instability 
region. These point mutations may also be clustered The 
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nucleotides to be altered can be chosen randomly (i.e., 
not chosen because of AT or GC content or the presence or 
absence of rare or preferred codons) , the only requirement 
being that the amino acid sequence encoded by the protein 
remain unchanged; or, if conservative and non- conservative 
5 amino acid substi tut ions are to be made, the only 

requirement is that the protein encoded by the mutated 
gene be substantially similar to the protein encoded by 
the non-mutated gene. 

In the method of the present invention, the gene 

10 sequence can be mutated so that the encoded protein 

remains the same due to the fact that the genetic code is 
degenerate, i.e., many of the amino acids may be encoded 
by more than one codon. The base code for serine, for 
example, is six- way degenerate such that the codons TCT, 

15 TCG, TCC, TCA, AGT, and AGC all code for serine. 

Similarly, threonine is encoded by any one of codons ACT, 
ACA, ACC and ACG. Thus, a plurality of different DNA 
sequences can be used to code for a particular set of 
amino acids . The codons encoding the other amino acids 

20 are TTT and TTC for phenylalanine; TTA, TTG, CTT, CTC, CTA 
and CTG for leucine; ATT, ATC and ATA for isoleucine; ATG 
for methione; GTT, GTC, GTA and GTG for valine; CCT, CCC, 
CCA and CCG for proline; GCU, GCC, GCA and GCG for 
alanine; TAT and TAC for tyrosine; CAT and CAC for 

25 histidine; CAA and CAG for glutamine; AAT and AAC for 
asparagine; AAA and AAG for lysine; GAT and GAC for 
aspartic acid; GAA and GAG for glutamic acid; TGT and TGC 
for cysteine; TGG for tryptophan; CGT, CGC, CGA and CGG 
for arginine; and GGU, GGC, GGA and GGG for glycine. 

30 Charts depicting the codons (i.e., the genetic code) can 
be found in various general biology or biochemistry 
textbooks . 

In the method of the present invention, if the 
portion (s) of the gene encoding the inhibitory/instability 
35 regions are AT- rich, it is preferred, but not believed to 



BNSDOCID: <WO 9320212A1 _l_> 



WO 93/20212 



PCT/US93/02908 



- 33 



15 



25 



be necessary, that most or all of the mutations in the 
inhibitory/instability region be the replacement of a and 
T with G and C nucleotides, making the regions more GC- 
nch, while still maintaining the coding capacity of the 
gene. If the portion (s) of the gene encoding the 
inhibitory/instability regions are GC-rich, it is 
preferred, but not believed to be necessary, that most or 
all of the mutations in the inhibitory/instability region 
be the replacement of G and C nucleotides with A and T 

10 ZT^ 3 ' ^ re9i0nS 1SSS GC " rich ' whil * "ill 

maintaining the codinq caDacit-v ^ 

vxxny capacity of the gene. If the INS 

IZ°VI e±ther AT ' riCh ° r GC - riCh ' ifc ±S most Preferred 
that it be altered so that it has a content of about 50% g 

and C and about 50% A and T. The AT- (or AU- ) content 
(or, alternatively, the GC-content) of an 

inhibitory/instability region or regions can be calculated 
by using a computer program designed to make such 
calculations. Examples of such programs, used to 
determine the AT- richness of the HIV-1 gag 

20 llTtT^T 31311 ^ re9i ° nS ~™ fied *-ein, are the 

Zt T r SXS 96 VAX (UniVerS ^ ° f Wisconsin) 

and the Gene Works Package (Intelligenetics) . 

In the method of the invention, if the INS 

region contains less -preferred corinnc ^ ■ 

* BJ - errea codons, it is preferable 

that those be altered to more -preferred codons if 
desired, however (e.g., to make an AT-rich region more GC- 
nch), more-preferred codons can be altered to less- 
Pref erred codons. it is also preferred, but not believed 

be rLTT^' leSS -^ efer ~ * - -rely used codons 

30 the ! ^ —-preferred codons. Optionally, only 

the most rarely used codons (identified from published 
codon usage tables, such as in T. Manama et al., Nucl. 
Acids Res. 14 (Supp) :rl5l-iQ7 mqqc\\ 

™* W 97 {1986 >> can be replaced with 

preferred codons, or alternatively, most or all of the 
^ rare codons can be replaced with preferred codons 

Generally, the choice of preferred codons to use will 
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depend on the codon usage of the host cell in which the 
altered gene is to be expressed. Note, however, that the 
substitution of more -preferred codons with less -preferred 
codons is also functional, as shown in the example below. 

As noted above, coding sequences are chosen on 
5 the basis of the genetic code and, preferably on the 
preferred codon usage in the host cell or organism in 
which the mutated gene of this invention is to be 
expressed. In a number of cases the preferred codon usage 
of a particular host or expression system can be 
10 ascertained from available references ( see , e.g., T. 
Maruyama et al., Nucl. Acids Res. l4(Supp) :rl51-197 
(1986) ) , or can be ascertained by other methods ( see , 
e.g., U.S. Patent No. 5,082,767 entitled "Codon Pair 
Utilization", issued to G. W. Hatfield et al. on January 
15 21, 1992, which is incorporated herein by reference) . 
Preferably, sequences will be chosen to optimize 
transcription and translation as well as mRNA stability so 
as to ultimately increase the amount of protein produced. 
Selection of codons is thus, for example, guided by the 
20 preferred use of codons by the host cell and/or the need 
to provide for desired restriction endonuclease sites and 
could also be guided by a desire to avoid potential 
secondary structure constraints in the encoded mRNA 
transcript. Potential secondary structure constraints can 
25 be identified by the use of computer programs such as the 
one described in M. Zucker et al., Nucl. Acids Res. 9:133 
(1981) . More than one coding sequence may be chosen in 
situations where the codon preference is unknown or 
ambiguous for optimum codon usage in the chosen host cell 
or organism. However, any correct set of codons would 
encode the desired protein, even if translated with less 
than optimum efficiency. 

In the method of the invention, if the INS 
region contains conserved nucleotides, it is also 
preferred, but not believed to be necessary, that 
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conserved nucleotides sequences in the 

inhibitory/instability region be mutated. Optionally, at 
least approximately 75% of the mutations made in the 
inhibitory/instability region may involve the mutation of 
conserved nucleotides. Conserved nucleotides can be 
determined by using a variety of computer programs 
available to practitioners of the art. 

In the method of the invention, it is also 
anticipated that inhibitory/instability sequences can be 
mutated such that the encoded amino acids are changed to 
contain one or more conservative or non- conservative amino 
acxds yet still provide for a functionally equivalent 
protein. For example, one or more amino acid residues 
within the sequence can be substituted by another amino 
add of a similar polarity which acts as a functional 
equivalent, resulting in a neutral substitution in the 
amino acid sequence. Substitutes for an amino acid within 
the sequence may be selected from other members of the 
class to which the amino acid belongs. For example, the 
nonpolar (hydrophobic) amino acids include alanine 
leucine, isoleucine, valine, proline, phenylalanine, 
tryptophan and methionine. The polar neutral amino acids 
include glycine, serine, threonine, cysteine, tyrosine 
asparagine, and glutamine. The positively charged (basic) 
amino acids include arginine, lysine and histidine. The 
negatively charged (acidic) amino acids include aspartic 
acid and glutamic acid. 

In the exemplified method of the present- 
invention, all of the regions in the Hiv-i gag gene 
suspected to have inhibitory/instability activity were 
first mutated at once over a region approximately 270 
nucleotides in length using clustered site-directed 
mutagenesis with four different oligonucleotides spanning 
a region of approximately 300 nucleotides to generate the 
construct pl7M1234, described infra, which encodes a 
J 3 stable mRNA. 
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The four oligonucleotides, which are depicted in 
Fig, 4, are 

Ml: ccagggggaaagaagaagtacaagctaaagcacatcgtatgggcaagcagg 
(SEQ ID NO: 6) ; M2 : 

ccttcagacaggatcagaggagcttcgatcactatacaacacagtagc (SEQ ID 

NO: 7) ; M3 : 

accctctattgtgtgcaccagcggatcgagatcaaggacaccaaggaagc (SEQ ID 
NO : 8 ) ; and M4 : 

gagcaaaacaagtccaagaagaaggcccagcaggcagcagctgacacagg (SEQ ID 
NO: 9). These oligonucleotides are 51 (Ml), 48 (M2) , 50 
(M3) and 50 (M4) nucleotides in length. Each 
oligonucleotide introduced several point mutations over an 
area of 19-22 nucleotides ( see infra ) . The number of 
nucleotides 5' to the first mutated nucleotide were 14 
(Ml) 7 18 (M2) ; 17 (M3 ) ; and 11 (M4) ; and the number of 
15 nucleotides 3' to the last mutated nucleotide were 15 

(Ml) ; 8 (M2) ; 14 (M3 ) ; and 17 (M4) . The ratios of AT to 
GC nucleotides present in each of these regions before 
mutation was 3 3AT/18GC (Ml) ; 30AT/18GC (M2) ; 29AT/21GC 
(M3) and 27AT/23GC (M4) . The ratios of AT to GC 
20 nucleotides present in each of these regions after 

mutation was 25AT/26GC (Ml); 24AT/24GC (M2); 23AT/27GC 
(M3) and 22AT/28GC (M4) . A total of 26 codons were 
changed. The number of times the codon appears in human 
genes per 1000 codons (from T. Maruyama et al. f Nuc. Acids 
25 Res. 14 (Supp. ) :rl5l-rl97 (1986)) is listed in parentheses 
next to the codon. In the example, 8 codons encoding 
lysine (Lys) were changed from aaa (22.0) to aag (35.8); 
two codons encoding tyrosine (Tyr) were changed from tat 
(12.4) to tac (18.4); two codons encoding leucine (Leu) 
were changed from tta (5.9) to eta (6.1); two codons 
encoding histidine (His) were changed from cat (9.8) to 
cac (14.3); three codons encoding isoleucine (lie) were 
changed from ata (5.1) to ate (24.0); two codons encoding 
glutamic acid (Glu) were changed from gaa (26.8) to gag 
(41.6); one codon encoding arginine (Arg) was changed from 
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aga 10.8) to cga (5.2, and one codon encoding arginine 
(Arg) was changed fro m agg (11 . 4) to cgg (? one 
encoding asparagine (Asn) was changed from aat (16.9) to 
aac (23.6); two codons encoding glutaxnine (Gin) were 

changed from caa (11.5) to cao t^o i\ 

0) co ca 9 (32.7); one codon encoding 

serine (Ser) was changed from agt (8.7) to tec (18.7). and 
one codon encoding alanine (Ala) was changed from gca' 

(12.7) to gec (29.8) . 

The techniques of oligonucleotide-directed site- 
specific mutagenesis employed to effect the modifications 
« structure or sequence of the DNA molecule are known to 
those of skill in the art. The target DNA sequences which 

are to be mutagenized can be rnwa • ~ 

=> <=<-i uau oe cuna, genomic DNA or 

synthesized DNA seouences r a n^,n 

aT . 0 „, „ . fences. Generally, these DNA sequences 

are cloned into an appropriate vector, e g a 
bacteriophage M13 vector, and single -stranded template DNA 
xs prepared from a plaque generated by the recombinant 

svnthet T' Sin9le - St ~*^ DNA is annealed to the 

synthetic oligonucleotides and the mutagenesis and 
^ subsequent steps are performed by methods well known in 
the art. see, e.g., M . Smith and S. Gillam, in Genetic 
T^ff 2 *-^^ Plenum PresT!^ 

2 48 i,T" ^ ^ KUnkSl ' Pr ° C - Natl - Acad - Sci. USA 
82:488-492(1985). ^ also, Sambrook et al (1989) 

25 o?f DNA 711 ' T thet±C ° li9 ° nucl -tides can be synthesized 
on a DNA synthesizer (e.g., Applied Biosystems) and 
purified by electrophoresis by methods known in the art. 
The length of the selected or prepared 

oligodeoxynucleo.tides using this method can vary There 

are no absolute stt-o i -j™-: 

30 f or wm ■ " mitS - AS a Mtt « ° f convenience, 

ror use in the Droce«?<= o-f • . 

process of this invention, the shortest 

length of the oligodeoxynucleotide is generally 
approximately 20 nucleotides and the longest length is 
generally approximately 60 to l 00 nucleotides. The size 
of the oligonucleotide primers are determined by the 
requirement for stable hybridization of the primers to the 
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regions of the gene in which the mutations are to be 
induced, and by the limitations of the currently available 
methods for synthesizing oligonucleotides. The factors to 
be considered in designing oligonucleotides for use in 
oligonucleotide-directed mutagenesis (e.g., overall size, 
5 size of portions flanking the mutation (s)) are described 
by M. Smith and S. Gillam in Genetic Engineering: 
Principles and Methods , Plenum Press 3:1-32 (1981) . In 
general, the overall length of the oligonucleotide will be 
such as to optimize stable, unique hybridization at the 
10 mutation site with the 5 f and 3' extensions from the 

mutation site being of sufficient size to avoid editing of 
the mutation (s) by the exonuclease activity of the DNA 
polymerase. Oligonucleotides used for mutagenesis in the 
present invention will generally be at least about 20 
15 nucleotides, usually about 40 to 60 nucleotides in length 
and usually will not exceed about 100 nucleotides in 
length. The oligonucleotides will usually contain at 
least about five bases 3' of the altered codons. 

In the preferred mutagenesis protocol of the 
20 present invention, the INS containing expression vectors 

contain the BLUESCRIPT plasmid vector as a backbone. This 
enables the preparation of double -stranded as well as 
single- stranded DNA. Single- stranded uracil containing 
DNA is prepared according to a standard protocol as 
25 follows: The plasmid is transformed into a F' bacterial 
strain (e.g.. DH5aF'). A colony is grown and infected 
with the helper phage M13-VCS [Stratagene #20025; IxlO 11 
pfu/ml] . This phage is used to infect a culture of the E_-_ 
coli strain CJ236 and single -stranded DNA is isolated 
30 according to standard methods. 0.25 ug of single- stranded 
DNA is annealed with the synthesized oligonucleotides (5 
ul of each oligo, dissolved at a concentration of 5 
OD 260 /ml. The synthesized oligonucleotides are usually 
about 40 to 60 nucleotides in length and are designed to 
35 contain a perfect match of approximately 10 nucleotides at 
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each end. They ny contain ag 
wxthxn the remaining 20-40 nucleotides The 
oligonucleotides are designed to cover the region of 
interest and they may be next t-o oa „ u 

hP <,™ k *. Ch ° ther or ther e may 

be gaps between them _ Up fcQ different * 

oligonucleotides have been used at- t-h*. 

He ^ w , . C the same time, althouah 

it is belxeved that the use of more than six th ° U3h 

oligonucleotides at the same time would also wo* in the 
method of this invention. After annealing, elongation 

io :: c co T :t polymerase produces che — -'- does 

not contain uracil. The free ends are ligated using 
usIHo E Thi V £SUl " in ^le-stranded L whlcn L he 
strld „h r T, ** MU S " ain HB1 ° 1 - ™ e «"t.t«. 

strand which does not contain uracil produces double- 
ts nd r ~ N :'i which concaias the int — 

individual colonies are Diek-«=rt -u 

. , ., e P 1CK ea and the mutations are 

a^dit T"** ^ SS< ™ an " ySis - Alternatively or 
useo t ChlS -thod can <and has been! 

used to select for different combinations of 

20 lZTZ le ° Me3 rSSUlt ±n mutant 

Phenotypes. This facilitates the analysis of the regions 
important for function and is helpful in subsequent 
experiments because it allows the analysis of Let 

exetrr r° lved in the ins - in « «- 

« deZ bfdl """^ is °* ^e INS - 1 region of HIV-i 

in Ine step a" 1 "' • ^ *" ^ ™* <° ™ 

in one step a region of 150 nucleotides using three 

but it L e t^ 11,8 UPPSr liBit ° f Chan ^ s - -ot clear. 
30 nucleotide ^ ^ ™ aU * m ° f =00 

ste^sinTth 6 Cha " 9ed ln 2 ° % ° f ChSir ™*-o«— in 
one seep using this protocol. 

The exemplified method of mutating by using 
oligonucleotide-directed sir* ™ • * • using 
vaiH«rt k,, • site -specific mutagenesis may be 

varied by using other methods known in the art. For 
example, the mutated gene can be synthesized directly 
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using overlapping synthetic deoxynucleotides ( see , e.g., 
Edge et al., Nature 292 :756 (1981); Nambair et al., 
Science 223 : 1299 (1984); Jay et al . , J. Biol. Chem. 
259 : 6311 (1984); or by using a combination of polymerase 
chain reaction generated DNAs or cDNAs and synthesized 
oligonucleotides . 



resultant mutated mRNAs can be tested in the same manner 
as the steady state level and/or stability of the 
unmodified mRNA containing the inhibitory/ instability 
regions are tested (e.g., by Northern blotting), as 
discussed in section 1, above. The mutated mRNA can be 
analyzed along with (and thus compared to) the unmodified 
mRNA containing the inhibitory/ instability region (s) and 
with an unmodified indicator mRNA, if desired. As 
exemplified, the HIV-1 pl7 8,lg mutants are compared to the 
unmutated HIV-1 pl7 gag in transfection experiments by 
subsequent analysis of. the mRNAs by Northern blot 
analysis. The proteins produced by these mRNAs are 
measured by immunoblotting and other methods known in the 
art, such as ELISA. See infra . 

VI. INDUSTRIAL APPLICABILITY 

Genes which can be mutated by the methods of 
this invention include those whose mRNAs are known or 
suspected of containing INS regions in their mRNAs. These 
genes include, for example, those coding for growth 
factors, interferons, interleukins , the fos proto- oncogene 
protein, and HIV-1 gag, env and pol, as well as other 
viral mRNAs in addition to those exemplified herein. 
Genes mutated by the methods of this invention can be 
expressed in the native host cell or organism or in a 
different cell or organism. The mutated genes can be 



4. 



Determination of Stability of the 
Mutated mRNA 



The steady state level and/or stability of the 



WO 93/20212 




PCT/US93/02908 



- 41 - 

introduce into a vector such as a pl asmid. cosmid. phage 
-rus or mmi - chronosome and inserted lnto , hosc ; ^ 

r iS " by -«»*■ ~" in the art. In ge „e"al 

genes can be utilized in anv cell 
5 , y cej --i-# either eukarvotir nr 

£TT:- ~ ian <••*•• ^ <• 9 

HeLa) , monkey (e.cr cn*\ r ,u k • _ , y ' 

* ie -9-» cos), rabbit (e.g., rabbit 

reticulocytes), rat, hamster (e g C HO an H v«h 

kidney cells) or mouse cells e T " * 

yeast Cft n, • g * ' Cells) ' P lant cells, 

10 loll) 111' CSllS ° r ^"^^ CS11S *- 

ss iisi™ r: ch can be utnized to *~ 

capable o f Z^^^^^^ » 

-in t-v.^ ' expressing the mutated qenes 

15 Tl current p a T 0r , e=CPreSSed - F ' »»<*«1 " 

Publis h!, rGl " in '" ^uta r B1n W , Greene 

Publxsh,„ 9 Associates and Wiley- mterscience U99 2) and 
Sambrook et al MQflcn ~ -, ' ana 

f« • examples of appropriate vector* 

for various types of host cells The n J vectors 
such genes can be replaced with" . Promoters for 

20 comD a tih lo -.v, replaced Wlth strong promoters 

tZsTZLI ^ int ° Wh±Ch thS *™ is Verted. 

These promoters may be inducible. The host cells 

containing these mutated genes can be used to express 
large amounts of the protein useful in enzyme 
preparations, phantiacput-iraio ^- 
« vaccines and tLZZcs ™^ 

con s t™ts" n n\\tn^s d aid y T °' inVe " Ci ° n ~ 

vivo or In , . nin9 Sald 3 enes "»y also be used for in- 
or an^a gene rep i acelnent . For — 

which produces an mRNA with an ^mk'* • 
3 0 reoion inhibitory instability 

region can be replaced with a gene that- h ae k 
u. , yexie cnat has been modifier) 

inc ZsTT ° f inVenCi ° n ^» to ultimately ' 

increase the amount of protein expressed. Such ge ne 
include viral ge „es and/or cellular g enes. Such g ene 
35 mi ^t be useful, for example, in the 

develops of a vaccine and/or genetic therapy 
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The constructs and/or proteins made by using 
constructs encoding the exemplified altered gag, env, and 
pol genes could be used, for example, in the production of 
diagnostic reagents, vaccines and therapies for AIDS and 
AIDS related diseases. The inhibitory/ instability 
5 elements in the exemplified HIV-l gag gene may be involved 
in the establishment of a state of low virus production in 
the host. HIV-l and the other lentiviruses cause chronic 
active infections that are not cleared by the immune 
system- It is possible that complete removal of the 
10 inhibitory/instability sequence elements from the 

lentiviral genome would result in constitutive expression. 
This could prevent the virus from establishing a latent 
infection and escaping immune system surveillance. The 
success in increasing expression of pl7 gag by eliminating 
15 the inhibitory sequence element suggests that one could 

produce lentiviruses without any negative elements . Such 
lentiviruses could provide a novel approach towards 
attenuated vaccines - 

For example, vectors expressing high levels of 
20 Gag can be used in immunotherapy and immunoprophylaxis , 
after expression in humans. Such vectors include 
retroviral vectors and also include direct injection of 
DNA into muscle cells or other receptive cells, resulting 
in the efficient expression of gag, using the technology 
25 described, for example, in Wolff et al., Science 247:1465- 
1468 (1990), Wolff et al., Human Molecular Genetics 
l(6):363-369 (1992) and Ulmer et al., Science 259:1745- 
1749 (1993) . Further, the gag constructs could be used in 
trans dominant inhibition of HIV expression after the 
30 introduction into humans. For this application, for 

example, appropriate vectors or DNA molecules expressing 
high levels of pSS^ or p37 gte would be modified to generate 
transdominant gag mutants, as described, for example, in 
Trono et al., Cell 59:113-120 (19 89). The vectors would 
35 be introduced into humans, resulting in the inhibition of 
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HIV production due to the combined mechanisms of gag 
transdominant inhibition and of immunostimulation by the 
produced gag protein. In addition, the gag constructs of 
the invention could be used in the generation of new 
retroviral vectors based on the expression of lentiviral 
gag proteins. Lentiviruses have unique characteristics 
that may allow the targeting and efficient infection of 
non-dividing cells. Similar applications are expected for 
vectors expressing high levels of env. 

Identification of similar inhibitory/ instability 
elements in SIV indicates that this virus may provide a 
convenient model to test these hypotheses. 

The exemplified constructs can also be used to 
simply and rapidly detect and/or further define the 
boundaries of inhibitory/instability sequences in any mRNA 
which is known or suspected to contain such regions, e.g., 
in mRNAs encoding various growth factors, interferons or ' 
interleukins , as well as other viral mRNAs in addition to 
those exemplified herein. 

The following examples illustrate certain 
embodiments of the present invention, but should not be- 
construed as limiting its scope in any way. Certain 
modifications and variations will be apparent to those 
skilled in the art from the teachings of the foregoing 
disclosure and the following examples, and these are 
intended to be encompassed by the spirit and scope of the 
invention. 

EXAMPLE 1 
HIV-1 GAG GENE 
The interaction of the Rev regulatory protein of 
human immunodeficiency virus type 1 (HIV-l) with its RNA 
target, named the Rev- responsive element (RRE) , is 
necessary for expression of the viral structure proteins 
(for reviews see G. Pavlakis and B. Felber, New Biol 
2:20-31 (1990); B. Cullen and W. Greene, Cell 51:423-426 
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(1989) ; and C. Rosen and G. Pavlakis, AIDS J. 4:499-509 

(1990) ) . Rev acts by promoting the nuclear export and 
increasing the stability of the RRE - containing mRNAs. 
Recent results also indicate a role for REV in the 
efficient polysome association of these mRNAs (S. Arrigo 
and I. Chen, Gene Dev. 5:808-819 (1991), D. D'Agostino et 
al., Mol. Cell Biol. 12:1375-1386 (1992)). Since the RRE- 
containing HIV-1 mRNAs do not efficiently produce protein 
in the absence of Rev, it has been postulated that these 
mRNAs are defective and contain inhibitory/ instability 
sequences variously designated as INS, CRS, or IR (M. 
Emerman et al. Cell 57:1155-1165 (1989); S. Schwartz et 
al., J. Virol. 66:150-159 (1992); C. Rosen et al., Proc. 
Natl. Acad. Sci. USA 85:2071-2075 (1988); M. Hadzopoulou- 
Cladaras et al., J. Virol. 63:1265-1274 (1989); F. 

15 Maldarelli et al . , J. Virol. 65:5732-5743 (1991); A. W. 
Cochrane et al., J. Virol. 65:5305-5313 (1991)). The 
nature and function of these inhibitory/instability 
sequences have not been characterized in detail. It has 
been postulated that inefficiently used splice sites may 
be necessary for Rev function (D. Chang and P. Sharp, Cell 
5£:789-795 (1989)); the presence of such splice sites may 
confer Rev- dependence to HIV-1 mRNAs. 

Analysis of HIV-1 hybrid constructs led to the 
initial characterization of some inhibitory/ instability 
25 sequences in the gag and pol regions of HIV-1 (S. Schwartz 
et al., J. Virol. 66:150-159 (1992); F. Maldarelli et al., 
J Virol 65:5732-5743 (1991); A. W. Cochrane et al., J. 
Virol. 65:5305-5313 (1991)). The identification of an 
inhibitory/ instability RNA element located in the coding 
region of the pl7 Bag matrix protein of HIV-i was also 
reported (S. Schwartz et al., J. Virol. 66:150-159 
(1992)) . It was shown that this sequence acted in cis to 
inhibit HIV-1 tat expression after insertion into a tat 
cDNA* The inhibition could be overcome by Rev- RRE, 
demonstrating that this element plays a role in regulation 
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by Rev. 



1 - Pl7 ea 8 expression plasnHrl 

To further study the inhibitory/ instability 
element in pi?**, a pl7^ expression plasmid (pi7, Fig. 1) 
was constructed. The pi7« sequence was engineered to 
contain a translational stop codon immediately after the 
coding sequence and thus could produce only pi 7^ (the 
construction of this plasmid is described below) . The 
major 5' splice site of HIV-l upstream of the gag AUG has 
been deleted from this vector (B. Felber et al., Proc. 
Natl. Acad. Sci. USA 8£: 1495 -1499 (1989)). To investigate 
whether plasmid pi7 could produce pi?" in the absence of 
Rev and the RRE, pl7 was transfected into HLtat cells (S. 
Schwartz et al., J. Virol, 64:2519-2529 (1990)) (see 
below) . These cells constitutively produce HIV-l Tat 
protein, which is necessary for transactivation of the 
HIV-l LTR promoter. Plasmid pi7 was transfected in the 
absence or presence of Rev, and the production of pl7^ 
was analyzed by western immunoblotting. The results 
revealed that very low levels of pi7^ protein were 
produced (Fig. 2A) . The presence of Rev did not increase 
gag expression, as expected, since this mRNA did not 
contain the RRE. Next, a plasmid that contained both the 
2s pl7™ coding sequence and the RRE ( P 17R, Fig. l) was 

constructed. Like pl7, this plasmid produced very low 
levels of pi7^ in the absence of Rev. High levels of 
pl7««* were produced only in the presence of Rev (Fig. 2A) . 
These experiments suggested that an inhibitory/ instability 
element was located in the pl7™ coding sequence. 

Expression experiments using various eucaryotic 
vectors have indicated that several other retroviruses do 
not contain such inhibitory/ instability sequences within 
their coding sequences (see for example, J. Wills et al . . 
J. Virol. 62:4331-43 (1989) and V. Morris et al., J. 
Virol. 62:349-53 (1988)). To verify these results, the 
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pl7 gag (matrix) gene of HIV-1 in plasmid pl7 was replaced 
with the coding sequence for pl9 gag (matrix) which is the 
homologous protein of the Rous sarcoma virus (RSV, strain 
SR-A) . The resulting plasmid, pl9 (Fig. 1) , was identical 
to plasmid pl7, except for the gag coding sequence. The 
production of pl9 gag protein from plasmid p!9 was analyzed 
by western immunoblotting, which revealed that this 
plasmid produced high levels of pl9 fia£ (Fig. 2A) . These 
experiments demonstrated that the pl9 gag coding sequence of 
RSV, in contrast to p!7 eag of HIV-1, could be efficiently 
expressed in this vector, indicating that the gag region 
of RSV did not contain any inhibitory/ instability 
elements. A derivative of plasmid pl9 that contained the 
RRE, named pl9R (Fig. l) was also constructed. 
Interestingly, only very low levels of p!9 gag protein were 
produced from the RRE -containing plasmid p!9R in the 
absence of Rev. This observation indicated that the 
introduced RRE and 3' HIV-l sequences exerted an 
inhibitory effect on pl9 gag expression from plasmid pl9R, 
which is in agreement with recent data indicating that in 
the absence of Rev, a longer region at the 3' end of the 
virus including the RRE acts as an inhibitory/instability 
element (G. Nasioulas, G. Pavlakis, B. Felber, manuscript 
in preparation) . In conclusion, the high levels of 
expression of RSV pl9 gae in the same vector reinforced the 
conclusion that an inhibitory/instability sequence within 
HIV-1 pl7 gag coding region was responsible for the very low 
levels of expression. 

It was next determined whether the 
inhibitory/ instability effect of the p!7 gag coding sequence 
was detected also at the mRNA level. Northern blot 
analysis of RNA extracted from HLtat cells transfected 
with pl7 or transfected with pl7R demonstrated that pl7R 
produced lower mRNA levels in the absence of Rev (Fig. 3 A) 
(See Example 3) . A two- to eight -fold increase in pl7R 
mRNA levels was observed after coexpression with Rev. 



BNSDOCID: <WO 932021 2A1 J_> 



WO 93/20212 



PCT/US93/02908 



- 47 - 



10 



15 



20 



25 



30 



35 



Plasmxd pi7 produced mRNA levels similar to those produced 
by P 17R xa the absence of Rev. Notably, Rev decreased the 
levels of mRNA and protein produced by mRNAs that do not 
contain RRE. This inhibitory effect of Rev in 
cotransfection experiments has been observed for many 
other non-RRE-containing mRNAs, such as luciferase and CAT 
(L. Solomin et al . , j. Virol 64:6010-6017 (1990)- D M 
Benko et al . , New Biol 2:1111-1122 (1990)). These results 
established that the inhibitory element in gag a i so 
affects the mRNA levels and are in agreement with previous 
findings (S. Schwartz et al . , j. virol. 66:150-159 
(1992)) . Quantitations of the mRNA and protein levels 
produced by pi 7 R in the absence or presence of Rev were 
performed by scanning densitometry of appropriate serial 
dilutions of the samples, and indicated that the 
difference was greater at the level of protein (60- to 
100-fold) than at the level of mRNA (2- to 8-fold) This 
result is compatible with previous findings of effects of 
Rev on mRNA localization and polysomal loading of both gag 
and env mRNAs (S. Arrigo et al . , Gene Dev 1:808-819 
(1991); D. D'Agostino et al., Mol . Cell. Biol. 12:1375- 
1386 (1992); M . Emerman et al., Cell 57:1155-1165 (1989)- 
B. Felber et al., Proc . Natl. Acad. Sci. USA 86 : 1495-1499 ' 
(1989), M. Malim et al.. Nature (London) 338:254-257 
(1989)) . Northern blot analysis of the mRNAs produced by 
the RSV gag expression plasmids revealed that pl9 produced 
high mRNA levels (Fig. 3B) . This further demonstrated 
that the P i9« coding sequence of RSV does not contain 
inhibitory elements. The presence of the RRE and 3' HIV-i 
sequences in plasmid pi9R resulted in decreased mRNA 
levels in the absence of Rev, further suggesting that 
inhibitory elements were present in these sequences 
Taken together, these results established that gag 
expression in HIV-i is fundamentally different from that 
m RSV. The HIV-i P i7- coding sequence contains a strong 
inhibitory element while the RSV pigm cod ing sequence 
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does not. Interestingly, plasmid pl9 contains the 5' 
splice site used to generate the. RSV env mRNA, which is 
located downstream of the gag AUG. This 5' splice site is 
not utilized in the described expression vectors (Fig. 
3B) . Mutation of the invariable GT dinucleotide of this 
5 5 f splice site to AT did not affect pl9 gag expression 

significantly (data not shown) . On the other hand, the 
HIV-1 pl7 expression plasmid did not contain any known 
splice sites, yet was not expressed in the absence of Rev. 
These results further indicate that sequences other than 
10 inefficiently used splice sites are responsible for 
inhibition of gag expression. 

2 . Mutated p!7 gag vectors 
To investigate the exact nature of the 
!5 inhibitory element in HIV-1 gag, site -directed mutagenesis 
of the pl7 gag coding sequence with four different 
oligonucleotides, as indicated in Fig. 4, was performed. 
Each oligonucleotide introduced several point mutations 
over an area of 19-22 nucleotides. These mutations did 
20 not affect the amino acid sequence of the pi? 8 ** protein, 
since they introduced silent codon changes. First, all 
four oligonucleotides were used simultaneously in 
mutagenesis using a single -stranded DNA template as 
described (T. Kunkel, Proc. Natl. Acad. Sci. USA 82:488- 
492 (1985); S. Schwartz et al., Mol. Cell. Biol. 12:207- 
219 (1992) ) . This allowed the simultaneous introduction 
of many point mutations over a large region of 270 nt in 
vector pl7. A mutant containing all four oligonucleotides 
was isolated and named p!7M1234. Compared to pl7, this 
plasmid contained a total of 28 point mutations 
distributed primarily in regions with high AU- content. 
The phenotype of the mutant was assessed by transfections 
into HLtat cells and subsequent analysis of pl7 sag 
expression by immunoblotting. Interestingly, pl7M1234 
produced high levels of pl7 gag protein, higher than those 
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produced by pi7R in the presence of Rev (Fig. 2A) This 
result demonstrated that the inhibitory/instability 
signals in pi7- mRNA had been inactivated in plasmid 
P17M1234. As expected, the presence of Rev protein did 
not increase expression from P 17M1234, but instead, had a 
slight inhibitory effect on gag expression. Thus, P i7^ 
expression from the mutant P 17M1234 displayed the same 
general properties as the pl9'«* of RSV, that is, a high 
constitutive level of Rev- independent gag expression. 
Northern blot analysis revealed that the mRNA levels 
produced by P 17M1234 were increased compared to those 
produced by pi7 (Fig. 3A) . 

To further examine the nature and exact location 
of the minimal inhibitory/instability element, the p it« 
coding sequence in plasmid P 17 was mutated with only one 
of the four mutated oligonucleotides at a time. This 
procedure resulted in four mutant plasmids, named pi7Ml 
P17M2, pi7M3, and pi7M4, according to the oligonucleotide 
that each contains. None of these mutants produced 
significantly higher levels of pi7*« protein compared to 
plasmid pi7 (Fig. 5), indicating that the 

inhibitory/instability element was not affected. The pl7 
coding sequence was next mutated with two oligonucleotides 
at a time. The resulting mutants were named P 17M12 
P17M13, P 17M14, pl7M23, pl7M24, andpl7M34. Protein 
production from these mutants was minimally increased 
compared with that from pi 7 . and it was considerably lower 
than that from pl7M1234 (Fig. 5 ) . m addition, a triple 
oligonucleotide mutant, P 17M123, also failed to express 
high levels of pi7- (data not shown) . These findings may 
suggest that multiple inhibitory/instability signals are 
present in the coding sequence of pi 7 - Alternatively a 
single inhibitory/instability element may span a large ' 
region, whose inactivation requires mutagenesis with more 
than two oligonucleotides. This possibility is consistent 
with previous data suggesting that a 218 -nucleotide 
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inhibitory/ instability element in the p!7 gaff coding 
sequence is required for strong inhibition of gag 
expression. Further deletions of this sequence resulted 
in gradual loss of inhibition (S. Schwartz et al., J. 
Virol. 66:150-159 (1992)). The inhibitory/ instability 
element may coincide with a specific secondary structure 
on the mRNA. It is currently being investigated whether a 
specific structure is important for the function of the 
inhibitory/ instability element. 

The pl7 gaff coding sequence has a high content of 
A and U nucleotides, unlike the coding sequence of pl9 gag 
of RSV (S. Schwartz et al., J. Virol. 66:150-159 (1992); 
G. Myers and G. Pavlakis, in The Retroviridae J. Levy, 
Eds. (Plenum Press, New York, NY, 1992), pp. 1-37). Four 
regions with high AU content are present in the pl7 gag 
15 coding sequence and have been implicated in the inhibition 
of gag expression (S. Schwartz et al., J. Virol. 66:150- 
159 (1992) ) . Lentiviruses have a high AU content compared 
to the mammalian genome. Regions of high AU content are 
found in the gag/pol and env regions, while the multiply 
spliced mRNAs have a lower AU content (G. Myers and G. 
Pavlakis, in The Retroviridae . J. Levy, Eds. (Plenum 
Press, New York, NY, 1992), pp. 1-37), supporting the 
possibility that the inhibitory/instability elements are 
associated with mRNA regions with high AU content. It has 
^ J been shown that a specific oligonucleotide sequence, 

AUUUA, found at the AU-rich 3' untranslated regions of 
some unstable mRNAs, may confer RNA instability (G. Shaw 
and R. Kamen, Cell 46:659-667 (1986)), Although this 
sequence is not present in the pl7 Eag sequence, it is found 
in many copies within gag/pol and env regions. The 
association of instability elements with AU-rich regions 
is not universal, since the RRE together with 3' HIV 
sequences, which shows a strong inhibitory/instability 
activity in our vectors, is not AU-rich. These 
observations suggest the presence of more than one type of 
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IZZ ^ SeqU — In ^tion to reducing 

the AU content, some of the mutations introduced in 
plasmid P i7 changed rarely used codons to more favored 
codons for human cells an-hnnrtv, 

M , , . S * Although the use of rare codons 

could be an alternative pxni^naf^^ * 

Live explanation for poor HIV gaq 

^o~fr',r iS tyPS ° £ tranSl «"-l regulation is not 
favored by these results, since the presence of Rev 

corrects the defect in gag expression. In addition, the 
observation that the presence of non- translated sequences 
reduced gag expression (for example, the RRE sequence in 
P17R) . suggests that translation of the 
inhibitory/instability region is not necessary for 

^1?°"' In "° duCCion ° f and 3 • HW sequences in 

P17M1234 was also able to decrease gag expression, 
verxfyxng that independent negative elements not acting 
co-translationally are responsible for poor expression 

3 ' ariH?J" ic ? tion and elimination of 

SCL SI in ** p" 

20 7« „ • T ° e££e0t ° f reraoval ° f IH « in the 

P"« coding region (the p it« coding region spans 

nucleotides 336-731. as described in the description of 
F^g. 1. <B) above, and contains the first of three parts 
(i.e.. P17, P 24. and pis) of the gag coding region, as 
25 .ndrcated on in Fig. (A) and , Qn ^ 

the complete gag gene expression vectors were constructed 
1D " hlCh ad ^"°nal sequences of the gag gene were 
inserted 3- to the mutationals altered p 17 ~ coding 

region, dovrastream of thp> <5t-,->r, 

or tne stop codon, of vector P 17M1234. 

3Q Three vectors containing increasing lengths of gag 

sequences were studied: P 17M1234 (731- 1081) , p 17 M1234 (731- 
1424) and P17M1234 (731-2165), as shown in Fig i (C) 
Levels of expression of pl7 ~ were measured, with the " 
results indicating that region of the mRNA encoding the 

35 second part of the gag protein do 

the p24««* nro tein I- k Part encodin 3 

tne P 24 protein, which spans nucleotides 731-1424) 
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contains only a weak INS, as determined by a small 
reduction in the amount of pl7 gag protein expressed by 
pl7M1234 as compared with the amount of pl7 gag protein 
expressed by pl7M1234 (731-1424) , while the region of the 
mRNA encoding the third part of the gag protein (i.e., the 
part encoding the pl5 gag protein, which spans nucleotides 
1425-2165) contains a strong INS, as determined by a large 
reduction in the amount of gag protein expressed by 
pl7M1234 (731-2165) as compared with the amount of protein 
expressed by p!7M1234 and p!7M1234 (731-1424) . 



4. p37M1234 vector 

The above analysis allowed the construction of 
vector p37M1234, which expressed high levels of p37 gag 
precursor protein (which contains both the pl7 gag and p24 gag 
^ protein regions) . Vector p37M1234 was constructed by 

removing the stop codon at the end of the gene encoding 
the altered pl7 gag protein and fusing the nucleotide 
sequence encoding the p24 gag protein into the correct 
reading frame by oligonucleotide mutagenesis. This 
restored the nucleotide sequence so that it encoded the 
fused pl7 gag and p24 gag protein (i.e., the p37 gag protein) as 
it is encoded by HIV-1. Since the presence of the p3 7 gttg 
or of the p24 g * e protein can be quantitated easily by 
commercially available ELISA kits, vector p37M1234 can be 
used for inserting and testing additional fragments 
suspected of containing INS . Examples of such uses are 
shown below. 



35 



5. Vectors p!7M1234 (731-1081) NS and p55BM1234 
Other vectors which. were constructed in a 
similar manner as was P37M1234 were pl7M1234 (731- 1081) NS 
and p55BM1234 (Fig. l. (C) ) . The levels of gag expression 
from each of these three vectors which allow the 
translation of the region downstream (3') of the pl7 
coding region, was respectively similar to the level of 
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gag expression from the vectors o™-, • • 

sequences t » n vectors containing the nucleotide 

sequences 3 to a stop codon (i.e., vectors P 17M1234 (731- 
1081), P17M1234 (731-1424) and P 17M1234 (731-2165) 
described above). These results al.o h 

INS regions in the gag gen "r T^"?™*'™* «™= th. 

. _ 3 y yene are not affected by 

translation or lack thereof through the INS region These 
results demonstrate the use of pi 7M 1234 to detect 
addxtxonal INS sequences in the Hiv-i gag coding region 

Thus r 1424 " 2165 encoding region of HIV - «>. 

Thus, these results also demonstrate how a gene comLnina 
one or more inhibitory/instability regions can b ZlZsl 

used to further locate additional inhibitory/instabilitv 
regxons within that gene, if any. ^/"stability 



6 - Vectors p^7Ml-mn 3 ^ p gg M i.m 

As described above, experiments indicated the 
presence of INS in the n54 „r,^ , ^ ecl tne 

fc 111 Cne P 24 and PIS region of HIV-i i n 

addxtion to those identifier! =,r„* -, • . 

ra „. ^ r ^aentifxed and elxmxnated in the pi7« 

regxon of HIV-i Th-io ■; ~ ^ p 
*> « on pa S e 7l6 .^ ; ^ ~ - h e^ tic all y in Figure 

< r >. In ^ figure , cgagM ^ s : : c r:r 6 " 7182 

P55BM1234. lcai co 

vectors a n! y rt SCUdying "** eXpressi °" °* W«- Protein i n 
vectors encodxng the d24*"« i->-i-,~.*- = .i „ . 
25 aaa and n«i o P«>texn containing additional 

gag and pol sequences, it was found that vectors that 
contaxned the complete gag gene and part of the pol gene 

g -^ ; r f 5BM1234 ' — Fig. 6, were not expLsse'd at 
h,gh levels, despxte the elimination of INS-I in the pl7 - 

30 ^ th " ribSd ^ The ors have * 

hypothesxzed that this is caused by the presence of 
multxple INS regions able to act i„ rt . P resence of 
other to oT ■ independently of each 

otner. To elxmxnate the additional twq m 
htv t , ^uxtxonai INS, several mutant 

HIV-l olxgonucleotides were constnirh^ , 

con scructea (see Table 9 \ 

xncorporated in various gaq exDrf3Qe! , n 2> and 

5 a = expressxon vectors p n r 

example, oligonucleotides M6gag M 7aaa mr / 

°a^g, M7gag, MSgag and MlOgag 
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were introduced into p37M1234, resulting in p37Ml-10D and 
the same oligonucleotides were introduced into p55BM1234, 
resulting in p55BMl-10. These experiments revealed a 
dramatic improvement of expression of p37 gag (which is the 
pl7 TC and p24 gag precursor) and p55 gag (which is the intact 
5 gag precursor molecule produced by HIV-1) upon the 

incorporation in the expression vectors p37M1234 and 
p55BM1234 of additional mutations contained in the 
oligonucleotides M6gag, M7gag, M8gag and MIOgag (described 
in Table 2) . Fig. € shows that expression was 
10 dramatically improved after the introduction of additional 
mutations . 

Of particular interest was p37Ml-10D, which 
produced very high levels of gag. This has been the 
highest producing gag construct (see Fig. 6) . 
15 interestingly, addition of gag and pol sequences as in 
vectors p55BMl-10 and p55AMl-10 (Fig. 6) reduced the 
levels of gag expression. Upon further mutagenesis, the 
inhibitory effects of this region were partially 
eliminated as shown in Fig. 6 for vector p55Ml-13P0. 
20 Introduction of mutations defined by the gag region 

nucleotides MIOgag, Mllgag, M12gag, M13gag, and pol region 
nucleotide MOpol increased the levels of gag expression 
approximately six fold over vectors such as p55BMl-10. 

The HIV-1 promoter was replaced by the human 
25 cytomegalovirus early promoter (CMV) in plasmids p37Ml-10D 
and p55Ml-13P0 to generate plasmids pCMV r 37Ml-10D and 
pOW55Ml-13P0, respectively. For this, a fragment 
containing the CMV promoter was amplified by PCR 
(nucleotides -670 to +73, where +1 is the start of 
30 transcription, see , Boshart, et al., Cell . 41, 521 

(19 85)) . This fragment was exchanged with the StuI - 
BssHII fragment in gag vectors p37Ml-10D and p55Ml-13P0, 
resulting in the replacement of the HIV-1 promoter with 
that of CMV. The resulting plasmids were compared to 
3^ those containing the HIV-1 promoter after transfection in 
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human cells, and gave similar high expression of gag. 
Therefore, the high expression of gag can be achieved in 
the total absence of any other viral protein. The 
exchange of the.HIV-i with other promoters is beneficial 
xf constitutive expression is desirable and also for 

expression in other mammalian cells s „rh ae 

,. , <~e-ns, such as mouse cells 

in whlch th e HIV-i promoter is wea k. 

The constructed vectors P 37Ml-i 0 D and p55BMl-l 0 
can be used for the Rev independent production of p 3 7- 
and p55~ proteins, respectively. In addition , chese 
vectors can be used as convenient reporters, to identify 
and eliminate additional l NS in different RNA molecules 
have h. U ! lng ^ Pr0t ° COlS Ascribed herein, regions 

Zl of Hx; " With±n 9P41 (the —membrane 

part of HIV-i env) coding area and at the post-env 3' 

region of HIV-i which contain INS Th. ~i • ■ 

uutdin XNS - The elimination of ins 
rrom gag, pol and env recrionc? =1n 

Vi -i /-rVi i . regions will allow the expression of 

high levels of authentic HIV-i statural proteins in the 
absence of the Rev regulatory factor of HIV- 1 . The 
mutated coding sequences can be incorporated into 
appropriate gene transfer vectors which may allow the 
targets of specific celis and/or more efficient gene 

be^ed^fo/d"" 3 "" 1 "' mUCaCed C ° din9 Se ^— — 

be used for direct expression in human or other cells in 

vlCr Q or in viv o with t-ho >~ ■ . — 

h-i„ h J~ 9 being tne Production of 

high protein levels and the aenei-at--i ™ ^ 

" lub generation of a stroncr immune 

response. The ultimate goal in either • I 

. ^ 3 in eitner case is subsequent 

protection from HIV infection and disease. 

inhibir dSSCribed ^riments demonstrate that the 

inhibitory/instability sequences srp ™, • * 
30 htv i ™ sequences are required to prevent 

stLt T SS1 ° n ' ThlS W ° Ck t0 Che session of viral 
structural proteins can be overcome by the Rev-RRE 

interaction. m the absence of INS, HIV-x expression 
would be similar to simpler retroviruses and Tould not 

« ZTIZIT/- ThUS ' 13 a ™^ concent of 

Rev regulation. Sequence comparisons suggest that the l„s 
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element identified here is conserved in all HIV-1 
isolates, although this has not been verified 
experimentally. The majority (22 of 28) of the mutated 
nucleotides in gag are conserved in all HIV-1 isolates, 
while 22 of 28 are conserved also in HIV- 2 (G. Myers, et 
5 al . , Eds . Human retroviruses and AIDS . A compilation and 
analysis of nucleic acid and amino acid sequences (Los 
Alamos National Laboratory, Los Alamos, New Mexico, 1991) , 
incorporated herein by reference) . Several lines of 
evidence indicate that all lentiviruses and other complex 

10 retroviruses such as the HTLV group contain similar INS 
regulatory elements . Strong INS elements have been 
identified in the gag region of HTLV- 1 and SIV (manuscript 
in preparation) . This suggests that INS are important 
regulatory elements, and may be responsible for some of 

15 the biological characteristics of the complex 

retroviruses. The presence of INS in SIV and HTLV- 1 
suggests that these elements are conserved among complex 
retroviruses. Since INS inhibit expression, it must be 
concluded that their presence is advantageous to the 

20 virus, otherwise they would be rapidly eliminated by 
mutations . 

The observations that the inhibitory/instability 
sequences act in the absence of any other viral proteins 
and that they can be inactivated by mutagenesis suggest 

25 that these elements may be targets for the binding of 

cellular factors that interact with the mRNA and inhibit 
post transcriptional steps of gene expression. The 
interaction of HIV-l mRNAs with such factors may cause 
nuclear retention, resulting in either further splicing or 

30 rapid degradation of the mRNAs. It has been proposed that 
components of the splicing machinery interact with splice 
sites in HIV-1 mRNAs and modulated mRNA expression (A. 
Cochrane et al., J. Virol. 65.:5305-5313 (1991); D. Chang 
and P. Sharp, Cell 59:789-795 (1989); X. Lu et al., Proc. 

35 Natl. Acad. Sci. USA 87:7598-7602 (1990)). However, it is 



WO 93/20212 jfe 

PCT/US93/02908 



10 



15 



20 



25 



30 



35 




- 57 - 



not likely that the inhibitory/instability elements 
described here are functional 5' or 3 ' splice sites 
Thorough mapping of HIV-l S pi ice sites performed by 
several laboratories using the Reverse Transcriptase -PGR 
technique failed to detect any splice sites within gag (S 
Schwartz et al. f J. Virol. 64:2519-2529 (1990)- J 
Guatelli et al . , J. Virol . 64:4093-4098 (1990)- E D 
Gerrett et al., j. virol. 65:1653-1657 (1991); M. Robert - 
Guroff et al . , j. virol . 64:3391-3398 (1990); S. Schwartz 
et al., J. virol. 64:5448-5456 (1990); g. Schwartz et 
al., Virology lfL3:677-686 (1991)). The suggestions that 
Rev may act by dissociating unspliced mRNA from the 
splicesomes (D. Chang and P. Sharp, Cell 59:789-795 
(1989)) or by inhibiting splicing (j. Kj ems et al . Cell 
67:169-178 (1991)) are not easily reconciled with the 
knowledge that all retroviruses produce structural 
proteins from mRNAs that contain unutilized splice sites. 
Spl 1C xng of all retroviral mRNAs, including HIV-l mRNAs in 
tne absence of Rev i=! inoff,-^^ 

„ , ' 13 lnef ficient compared to splicing of 

cellular mRNAs (J. Kjems et al., Cell 67:169-178 (1991) - 

LT'^r GSne DeV ' ^ :1158 - 11 ^ («90); R. K a ; 2 

and A. Skalka, Mol. Cell. Biol. 10:696-704 (1990)- C 
Stoltzfus and S. Fogarty, j. virol. 63:1669-1676 '(1989)) 
The majority of the retroviruses do not produce Rev-like' 
proteins, yet they efficiently express proteins from 
partially spliced mRNAs, suggesting that inhibition of 
expression by unutilized splice sites is not a general 
property of retroviruses. Experiments using constructs 
expressing mutated HIV-l gag and env mRNAs lacking 
functional splice sites showed that only low levels of 
these mRNAs accumulated in the absence of Rev and that 
their expression was Rev-dependent (M. Emerman et al 
Cell 52:1155-1165 (1989); B. Pelber et al., p roc . Na^ . 
Acad. SCI. USA 86:1495-1499 (1989); M. Malim et al 
Nature (London) 331:254-257 (1989)). This led to the 
conclusion that Rev acts independently of splicing ( B 
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Felber et al., Proc. Natl. Acad. Sci. USA 86:1495-1499 
(1989); M. Malim et al., Nature (London) 33.8:254-257 
(1989)) and to the proposal that inhibitory/ instability 
elements other than splice sites are present on HIV-1 
mRNAs (C. Rosen et al., Proc. Natl. Acad. Sci. USA 
85:2071-2075 (1988); M. Hadzopoulou-Cladaras , et al . , J. 
Virol.' 63:1265-1274 (1989); B. Felber et al., Proc. Natl. 
Acad. Sci. USA 86:1495-1499 (1989)). 

Construction of the Gag Expression Plasmids 

Plasmid p!7R has been described as pNL17R (S. 
Schwartz et al., J. Virol. £6:150-159 (1992)). Plasmid 
pl7 was generated from pl7R by digestion with restriction 
enzyme Asp718 followed by religation. This procedure 
deleted the RRE and HIV-1 sequences spanning nt 8021-8561 
upstream of the 3' LTR. To generate mutants of pl7 eag , the 
pl7 eag coding sequence was subcloned into a modified 
pBLUESCRIPT vector (Stratagene) and generated single 
stranded uracil -containing DNA. Site-directed mutagenesis 
was performed as described (T. Kunkel, Proc. Natl. Acad. 
Sci. USA 82:488-492 (1985); S. Schwartz et al . , Mol. Cell 
Biol. 12:207-219 (1992))." Clones containing the 
appropriate mutations were selected by sequencing of 
double -stranded DNA. To generate plasmid pl9R, plasmid 
pl7R was first digested with BssHII and EcoRI, thereby 
deleting the entire p!7 g8g coding sequence, six nucleotides 
upstream of the pl7 ga& AUG and nine nucleotides of linker 
sequences 3' of the pl7 gae stop codon. The pl7 eag coding 
sequence in pl7R was replaced by a PCR- amplified DNA 
fragment containing the RSV pl9 £ag coding sequence (R. 
Weiss et al . , RNA Tumor Viruses . Molecular Biology of 
Tumor Viruses (Cold Spring Harbor Laboratory, Cold Spring 
Harbor, New York, 1985) ) . This fragment contained eight 
nucleotides upstream of the RSV gag AUG and the pl9 Elg 
coding sequence immediately followed by a translational 
stop codon. The RSV gag fragment was derived form the 
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infectious RSV proviral clone S-ka (r . Weiss ec ^ 
J^or y 1T1 , w Mm naamn^ nf , ^ 

Spring Harbor Laboratorv mi* o "-oia 
198511 ,o aDOrator y' c ° ld Spring Harbor, New York, 

f ill \ "* deriVSd P19R by •» Asp 718 

f r agne nt containing the RRE and 3 ' HIV- x sequences 

spanning nt 8021-8561. 

HLtat cells (S. Schwartz et al., j. virol 

" ctreclpitat ■ (1 " 0) ' " Sre using the cai; ium 

Eb v T teCb ^ <»• Grah^ et al. and A. Van der 

«b. Virology 52=456-460 ,1973,, as described ,B. Pelber et 

*«• S ^ °SA M: i 49 5-1499 ,1939,, 

absent f , P1 ™ 1234 ' »»' « *» «- 

15 plasld OL I" P yr enCS <+) ° £ 2 " ° £ the ^--expressing 
USa L ,!« Felb " 6C " 1 - Acad, Sci 

USA .86: 1495-1499 (iqqq \ \ m u 

hranaf ,. (1989)). The total amount of DNA in 

transfectxons was adjusted to 17 Mg per 0.5 ml of 
precx pitat 60 „ plate us . ng puci9 carr±er 

20 exacts 6 " 20 h — s f — d and cell - 

extracts were subjected to electrophoresis on 12 5% 
denaturing polyacrylamide gels and analyzed by * 

xmmunoblotting using either human HIV-i Da tH^t- 
(d-H-,,,,, nj - v 1 patient serum 

(Scrxpps) or a rabbit anti-pig** serum m?w i, 
(J. de Wet et al Mo i cell «■ ? P^V-lucxf erase 

25 contains th* ,■ 1, ^ 2:725 " 737 ("87)) that 

contains the firefly luciferase gene linked to the RSV LTR 

Promoter, was used as an internal standard to conLi foT 

r: on : ff ; ciency and was de ::: b : 

f° l0min et al " J - Virol. £4:6010-6017 (1990)) The 
3q results are set forth in Fig. 2 . ^ The 

Northern ftnaly gi B 

and h, HL ! at 06113 W6re transfec ted as described above 
and harvested 20 h post tr= rofo ,, • e 

P Cra nsfection. Total RNA was 

» L^l "Hi": h ~^— «■ Krawc^ aid c. 

wu, Anal. Biochem. 165:20-27 noflin ^ ^« 

^/ (1987)), and 20 /zg of total 
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RNA was subjected to northern blot analysis as described 
(M. Hadzopoulou-Cladaras et al., J. Virol. 63 : 1265-1274 
(1989)) . The filters were hybridized to a nick- translated 
PCR- amplified DNA fragment spanning nt 8304-9008 in the 
HIV-1 3' LTR. The results are set forth in Fig. 3. 

5 

EXAMPLE 2 
HIV-1 ENV GENE 
Fragments of the env gene were inserted into 
vectors pl9 or p37M1234 and the expression of the 

10 resulting plasmids were analyzed by transf ections into 
HLtat cells . It was found that several fragments 
inhibited protein expression. One of the strong INS 
identified was in the fragment containing nucleotides 
8206-8561 ("fragment [8206-8561]"). To eliminate this 

15 INS, the following oligonucleotides were synthesized and 
used in mutagenesis experiments as specified supra . The 
fragment was derived from the molecular clone pNL43 r which 
is almost identical to HXB2 . The numbering system used 
herein follows the numbering of molecular clone HXB2 

20 throughout- The synthesized oligonucleotides follow the 
pNL43 sequence . 

The oligonucleotides which were used to 
mutagenize fragment [8206-8561] , and which made changes in 
the env coding region between nucleotides 8210-8555 (the 

25 letters in lower case indicate mutated nucleotides) were: 



#1: 

8194-8261 

GAATAGTGCTGTTAACcTcCTgAAcGCtACcGCtATcGCcGTgGCgGAaGGaACcGAc 
30 AGGGTTATAG (SEQ ID NO: 10) 

#2 

8262-8323 

AAGTATTACAAGCcGCcTAccGcGCcATcaGaCAtATcCCccGccGcATccGcCAGGG 
35 CTTG (SEQ ID NO: 11) 
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#3 

8335-8392 
#4 

8393-8450 




(SEQ ID NO: 13) — cGCcGTATCTCGAGAC 

#5 

8451-8512 

ccT™^r cGccATcAccccctct ^ ccGccGccAA ^ c ^™^ 

CCTG (SEQ ID NO: 14) 
#6 

8513-8572 

GCTAGAAGCACAgGAaGAaGAgGAaGTcGGcTTcCCcGTtACcCCTCAGGTACCTTTA 
AG (SEQ ID NO: 15) A 

20 _ 

The expression of env was increased by the - 
elimination of the INS in fragment [8206-8561] as 
determined by analysis of both mRNA and protein. 

T ° fUrthSr ch a^cterize in detail the INS in 

25 ! I! SnV ' ^ C ° ding regi ° n ° f env — divided into 
different fragments, which were produced by PGR using 

appropriate synthetic oligonucleotides, and cloned in 

by additional mutagenesis as described above. After 

30 nTahTT 11 int ° hUman C£llS ' VSCt0r P37M1 - 10D P«-»ce. 

Tit 3 P37SV Pr0te±n - toy Str ° ng INS will 
inhibit the expression of gag if ligated in the same 

vector. The summary of the env fragments used is shown in 
F gure 11. The results Qf ^ experiments ^ 

35 11" SXiSt mUltiplS re ^ ions inhibiting 

expression m HIV-l env, and combinations of such regions 
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result in additive or synergistic inhibition- For 
example, while fragments 1, 2, or 3 individually inhibit 
expression by 2-6 fold, the combination of these fragments 
inhibits expression by 30 fold. Based on these results, 
additional mutant oligonucleotides have been synthesized 
5 for the correction of env INS . These oligonucleotides 
have been introduced in the expression vectors for HIV-l 
env pl20pA and pl20R270 (see Fig. 7) for the development 
of Rev- independent HIV-l env expression plasmids as 
discussed in detail below. 

0 

l. The mRNAs for gpl60 and for the 

extracellular domain (gp!2 0) are defective 
and their expression depends on the 
presence of RRE in cis and Rev in trans 



1.1 Positive and Negative Determinants for 
env mRNA Expression of HIV 



20 



25 



30 



35 



Previous experiments on the identification and 
characterization of the env expressing cDNAs had 
demonstrated that Env is produced from mRNAs that contain 
exon 4AE, 4BE, or 5E. (Schwartz et al., J- Virol. 
64:5448-5456 (1990); Schwartz et al., Mol . Cell. Biol. 
12:207-219 (1992) . All constructs generated to study the 
determinants of env expression are derived from pNL15E. 
This plasmid contains the HIV-l LTR promoter, the complete 
env cDNA 15E, and the HIV 3' LTR including the 
polyadenylation signal (Schwartz, et al . J. Virol. 
64:5448-5456 (1990) (Fig. 7) . pNL15E was generated from 
the molecular clone pNL4-3 (pNL4-3 is identical to pNL43 
herein) (Adachi et al., J. Virol. 59:284-291 (1986) and 
lacks the splice acceptor site for exon 6D, which was used 
to generate the tev mRNA (Benko et al., J. Virol. 64:2505- 
2518 (1990) . The Env expression plasmids were transfected 
in the presence or absence of the Rev- expressing plasmid 
pL3crev (Felber et al., J\ Virol. 64:3734-3741 (1990) into 
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HLtat cells (Schwartz et al., j. virol. 64:2519-2529 
(1990), which constitutively express Tat (one-exon Tat) 
One day later, the cells were harvested for analyses of" 
RNA and protein. Total RNA was extracted and analyzed on 
Northern blots. Protein production was measured by 
Western blots to detect cell -associated Env. m the 
absence of Rev, NL15E mRNA was efficiently spliced and 
produced Nef ; in the presence of Rev, most of the RNA 
remained unspliced and produces the Env precursor gpiso 
which is processed to gpi20 f the secreted portion of the 
precursor and gp4i. 

To allow for the effects of INS to be 
distinguished and studied separately from splicing, splice 
sites known to exist within some of the fragments used 
were eliminated as discussed below. Analysis of the 
resulting expression vectors included size determination 
of the produced mRNA, providing the verification that 
splicing does not interf^-r^ wit-v. *-v^ • _ 

. terrere Wlth th e interpretation of the 

data. 

1.2 Env expression is Rev-dependent also 
in the absence of functional splice 
sites ^ 

To study the effect of splicing on env 
expression, the splice donor at nt 5592 was removed by 
25 site-directed mutagenesis (changing GCAGTA to GaAtTc, and 

TeI^^ an ECORI site) - which resuited in 

15ESD- (Fig. 7) . The mRNA from this construct was 
efficiently spliced and produced a small ^ encoding Nef 
(Fig. 8). Sequence analysis revealed that this spliced 
mRNA was generated by the use of an alternative splice 
donor located at nt 5605 (TACATgtaatg) and the common 
splice acceptor site at nt 7925. m contrast to published 

MoL ^ Pr ° C - ACad - SGi - US * 87:7598-7602 

« I I eXPreSSi ° n ° f Env fro * this mutant depended on 
35 Rev. Next, the splice accenfnr = 

y ice acce Ptor site was mutated at nt 

. Since previous cDNA cloning had revealed that in 
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addition to the splice acceptor site at nt 7925 there are 
two additional splice acceptor sites at nt 7897 and nt 
7901 (Schwartz, et al. J. Virol. 64:2519-2529 (1990), this 
region of 43 bp encompassing nt 7884 to nt 7926 was 
removed. This resulted in plSEDSS (Fig. 7) . Northern 
5 blot analysis of mRNA from HLtat cells transfected with 
this construct confirmed that the 15EDSS mRNA is not 
spliced (Fig. 8B) . Although all functional splice sites 
have been removed from plSEDSS , Rev is still required for 
Env production (Fig. 8A) . Taken together with data 

10 obtained by studying gag expression, these results suggest 
that the presence of inefficiently used splice sites is 
not the primary determinant for Rev- dependent Env 
expression. It is known that at least two unused splice 
sites are present in this mRNA (the alternative splice 

15 donor at nt 5605 and the splice donor of exon 6D at nt 
6269) . Therefore, it cannot be ruled out that initial 
spliceosome formation can occur, which does not lead to 
* the execution of splicing. It is possible that this is 
sufficient to retain the mRNA in the nucleus and, since no 

20 splicing occurs, that this would lead to degradation of 
the mRNA. Alternatively, it is possible that 
splice- site- independent RNA elements similar to those 
identified within the gag/pol region (INS) are responsible 
for the Rev dependency (Schwartz et al., J. Virol. 

25 66:7176-7182 (1992); Schwartz et al., J. Virol. 66:150- 
159 (1992) . 

1.3 Identification of negative elements 
within gpl20 mRNA 

30 To distinguish between these possibilities, a 

series of constructs were designed that allowed the 
determination of the location of such INS elements. 
First, a stop codon followed by the restriction sites for 
Nrul and Mlul was introduced at the cleavage site between 

35 the extracellular gpl20 and the transmembrane protein gp4l 
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at nt 7301 in plasmid NL15EDSS result-™ - ™ 

7 > T oa ' resulting m pl20DSS (Fia 

7). Immunoprecipitation of cmi20 fmm i-h 

_ ,, . gp-L^u from the medium of cells 

transfected with m^nnca iiS 
levp , , pl20DSS confirmed the production of high 

levels of gp 12 o only in the presence of Rev (P ig 9B , 

The release of gpi 2 o is very efficient- «• , ' 

detectahin efficient, since only barely 

detectable amounts remain associated with the cells (daJ 
not h , . This f ±nding ^ ^ * ~ * - - 

the translation of the gp 4 i n or tion ^ „>, 

= sf*J- portion of the env cDNA i<? 

responsive for the defect in e „v expression. Next he 

121V ° £ St ° P COd0 ° ° £ 911120 '—is t in g * gp 41 
including the RRE and 3- LTR) with the SV40 

polyadenylation signal ,F tg . 7) „ as replaced . This 
construct. pa20 pA, produced very low levels of gp 120 i„ 
the absence of Rev , Fig . 9B) . Background level f of £ 
were produced from pl20DR (Fi „ 7) „„. . 
from MSiMnoc v. ' ch ™ as generated 

trom PBS120DSS by removing the S- portion of gp 4 l 

These T 9 T " 3C " S200) »)- 

These results demonstrate the presence of a major INS-lite 

ITZl^ 35120 P ° rti0n - T ° St ^ ^ o 

et al. f J. Virol. 64:6010-6017 (1990) 
were inserted into pi 20 pA downstream of the gpi 2 o stop 

resp^ti::: 1 ^! 5 in pi2 ° R33 °< — 70 , and 

ZlTllTl 9 ' ^"itations demonstrated 

that the presence of Rev in trans and the rre in cis could 

:t*:t in the 915120 — * ™ 

levels of gp 120 were produced from pi 20 R330 (data not 

shown), pl 20 R 2 70, andpl 20 RD345 (Fia 9B ) in ^ 

of Rev i*ig. 9B) m the presence 



protein dalTT <Pi9 ' 8A » ""firmed the 

protein data. The Drp^onpo « 

itie presence of Rev resulted in the 

H^ZTV hl9h XeVSlS ° f * PBS120DSS, 

were Lodu^ f ^ ^ ^ °* -» 

were produced from pl20DpA and P 120DR. 



35 
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2. Identification of INS elements located 
within the env mRNA regions using two 
strategies 

To identify elements that have a down regulatory 
effect in vivo, fragments of env cDNA were inserted into 
5 two different test expression vectors, p!9 and p37Ml-10D. 
These vectors contain a strong promoter for rapid 
detection of the gene product, such as the HIV-1 LTR in 
the presence of Tat, and an indicator gene that is 
expressed at high levels and can easily be assayed such as 

10 pi9 8a * of RSV or the mutated p37 ga * gene of HIV-1 

(p37Ml-10D) , neither of which contains any known INS -like 
elements. Expression vector pl9 contains the HIV-1 LTR 
promoter, the RSV p!9 gag matrix gene, and HIV-1 sequences 
starting at Kpnl (nt 8561) including the complete 3' LTR 

15 (Schwartz, et al., J. Virol. 66:7176-7182 (1992). Upon 
transfection into HLtat cells high levels of pl9gag are 
constitutively produced and are visualized on Western 
blots. Expression vector p37Ml-10D contains the HIV-1 LTR 
promoter, the mutant p37gag (Ml-10) , and the 3' portion of 

20 the virus starting at Kpnl (nt 8561) . Upon transfection 
into HLtat cells this plasmid constitutively produces 
p37 gag that can be quantitated by the HIV-l p24 gag antigen 
capture assay. 

25 2.1 Identification of INS elements using 

the RSV aaa expression vector 

INS elements within the gp41 and gpl20 portions 
were identified. To this end, the vector pl9 was used and 
the following fragments (Fig. 10) were inserted: (A) nt 

30 7684 to 7959; (B) nt 7684 to 7884 and nt 7927 to 7959; 

this is similar to fragment A but has the region of the 
splice acceptors 7A, 7B and 7 deleted; (C) nt 7595 to 7884 
and nt 7927 to 7959, having the splice sites deleted as in 
B; (D) nt 7939 to 8066; (E) nt 7939 to 8416; (F) nt 8200 

35 to 8561 (Hpal-Kpnl) ; (G) nt 7266 to 7595 containing the 
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10 



15 



intact RRE; (H) nt 55? ^ t-o eion 

„ a i , , ' haVing the s P lice donor 

oD5 deleted. 



Fragments A, B, and D did not affect Gag 
expression, whereas fragment G (RRE) decreased gag 
expression approximately 5x. Fragment C, E. and H lowered 
Gag express^ by about 10-20-fold indicating the presence 
of INS elements. 

Interestingly, it was observed that the 

insertion of element F spanning 350 bp in plasmid pi 9 

abolished production of Gag, indicating t-h Q 

= ' J-ncii eating the presence of a 

strong INS within this element Tho w 

element. The presence of the RRE 
m cis and Rev in trans rpcuit-o^ -i 

1 , crans resulted m production of high 

levels of rsv pis-. Fragm ent P also had a smaller 
downregulatory effect on the expression of the 
INS-corrected P 17^ of HIV-i ( P 17M1234) . These 
experiments revealed the presence of multiple elements 
located within the env mRNA that cause inhibition of pi 9 - 
expression. 

2.2 Elimination of the INS within 
20 fragment- g UI1X " 

Six synthetic oligonucleotides (Table 3) were 
generated that introduced 103 point mutations within this 
region of 330 nt without affecting the amino acid 

composition of Env Th<* n„,t^ 0 j c 
25 _ g hrt #4S The mutat ed fragment F was tested in 

P19 to verify that the INS elements are destroyed. The 
introduction of the mutations within oligo#i only 
marginally affected the expression of pl9 ~ whereas the 

thTlNS 6 t aU ° ligOS (#1 t0 #6> c ^«fly inactivated 
the INS effect of fraoment- v 
30 a „ _ fragment F. This is another example 

that more than one region within an tmo -> 
K , _ . a wicnm an INS element needed to 

by mutagnemzed to eliminate the INS effect. 

It is noteworthy that this INS element is 
present m all the multiply spliced Ra v ^ 
snrh =» e P cea Rev " independent mRNAs , 

35 2f .1 ^ * Ex P erime -t:s were performed to 

define the function of fragment F within the class of the 
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small mRNAs by removing this fragment from the tat cDNA. 
In the context of this mRNA, this element confers only a 
weak INS effect (3-5- fold inhibition), which suggests 
that inhibition of expression in env mRNA may require the 
presence of at least two distinct elements. These results 
suggested that the INS effect within env is based on 
multiple interacting components. Alternatively, the 
relative location and interactions among multiple INS 
components may be important for the magnitude of the INS 
effect. Therefore, more than one type of analysis in 
different vectors may be necessary for the identification 
and elimination of INS. 

2.3. Identification of INS elements using 
P37M1-10D expression vector 

The env coding region was subdivided into 
different consecutive fragments. These fragments and 
combinations of thereof were PCR-amplif ied using oligos as 
indicated in Fig. 11 and inserted downstream of the 
mutated p37^ gene in p37Ml-10D. The plasmids were 
transfected into HLtat cells that were harvested the next 
day and analyzed for p24 gas expression. Fig. 11 shows that 
the presence of fragments 2, 3, 5 as well as the 
combination 1+2+3 lowered gag expression substantially. 
Different oligos (Table 4) were synthesized that change 
the AT- rich domains including the three AATAAA elements 
located within the env coding region by changing the 
nucleotide but not the amino acid composition of Env. In 
a first approach, these oligos 1-19 are being introduced 
into plasmid pl20R270 with the goal or producing gpl20 in 
a Rev- independent manner. Oligonucleotides such as oligos 
20-26 will then be introduced into the gp41 portion, the 
two env portions combined and the complete gpl60 expressed 
in a Rev- independent manner. 

35 
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PROTO- ONCOGENE C-FOS 



Fragments of the fos 



gene were inserted into the 



vector pi9 and the expression of h ha , • 

Mprp aTla . „ w *pression of the resulting plasmids 



were analyzed by transfect 



found that several fragments inhib 



ions into HLtat cells, it 



was 



ited protein expression. 



A strong ins was identified -in * 

nurlen ,, ( , „ identified m the fragment containing 
nucleotides 3328-3450 ("fragment [3328- 



(nucleotides of the fos gene are numbered 
Genebank sequence entry HUMCFOT 
addition, a weaker element 



3450] ") 



according to 



ACCESSION # V01512 



) 



In 



region 



was identified in the coding 

To eliminate these INS the following 

ized and are used in 



oligonucleotides were synthes 
mutagenesis experiments as specif ied"supra 

To eliminate the INS in the fos non- coding 
region, the following oligonucleotides 
in the fos non-cnrfinrr 



in the f 

3450] (the letters 
nucleotides), were synthes 



non 
which 



make changes 



[3328- 



sujora: 



non- coding region between nucleotides 
in lower case indicate mutated 

ized and are used to mutagenize 

s as specified 



fragment [3328-3450] , mutagenesis experiment 



#1: 

3349-3391 

M TGAAAACGTTcgcaTGTGTcgcTAcgTTgcTTAcTAAGATGGA , SEQ ID N0 



30 



#2: 

3392-3434 



TTCTCAGATAc cTAgcTTcaTATTgc cTTaTTgTCTACCTTGA 



(SEQ ID NO: 



" P^Tor JZlllTZ c T rted into vectors 919 ■ 

PJ7M1234 and the expression o£ Che resulting 
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plasmids are analyzed after transfection into HLtat cells. 

The expression of fos is expected to be 
increased by the elimination of this INS region. 

To further define and eliminate the INS elements 
in the coding region, additional longer fragments of fos 
5 are introduced into vector p37M1234. The INS element in ' 
the coding region is first mapped more precisely using 
this expression vector and is then corrected using the 
following oligonucleotides: 

10 #1 

2721-2770 

GCCCTGTGAGtaGGCActGAAGGacAGcCAtaCGtaACatACAAGTGCCA (SEQ ID 
NO: 18) 

15 #2 

2670-2720 

AGCAGCAGCAATGAaCCTagtagcGAtagcCTgAGtagcCCt ACGCTGCTG (SEQ 
ID NO: 19) 

20 #3 

2620-2669 

ACCCCGAGGCaGAtagCTTtCCatCcTGcGCtGCcGCtCACCGCAAGGGC (SEQ ID 
NO: 20) 

25 #4 

2502-2562 

CTGCACAGTGGaagCCTcGGaATGGGcCCtATGGCtACcGAatTGGAaCCaCTGTGCA 
CTC (SEQ ID NO: 21) 

30 The expression of fos is expected to be 

increased by the elimination of this INS region. 
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EXAMPLE A 
HIV-1 POL GENE 
Vector p37Ml234 was used to eliminate an 
mnibitory/instabilitv semipnro 

wh-ioh v, ^ w sequence from the pol gene of HIV-i 

which had been characterized by AW Cochrane et al 
"Identification and characterization of intragenic' 
sequences which repress human immunodeficiency virus 
structural gene expression", j. virol . 65:5305-5313 
(1991) . These investigators suggested that a region in 

pol (HIV nucleotides 3792-4052), termed CRS ■ 
for -inh-iK-i*-- termed CRS, was important 

for inhibition. A larger fragment spanning this region 
which container nucleotides 3700-4194. was inserted into 
the vector P 37 M l 23 4 and its effeccs OR ^ 

TZTl raSUlCin S Plasmid (plasmid P37M1234RCRS) 

15 ce!ls analyZed « ft « "ansfection into HLtat 

Severe inhibition of gag expression (10 fold 
see Pig. 13) „as observed. 

In an effort to eliminate this INS, the 
following oligonucleotides were synthesized (the letters 
in lower case indicated mutated nucleotides) and used in 
mutagenesis experiments. 

instate i,- Pi T' 1C ^ ° bServed that °™ AUUUA potential 
instability element was within the INS region. This was . 

rlZZT by , mUta9eneSis oligonucleotide MIOpol and 

resulted in plasmid P37M1234RCRSP10 . The expression of 
gag from this plasmid was not improved, demonstrating that 
elimination of the AUUUA element alone did not eliminate 

cne INS. See Fia n ~ 

S^fi Fig. 12. Therefore, additional mutagenesis- 
was performed and it was shown fh^ ^. g : 

was snown that a combination of 

mutations introduced in plasmid P 37M1234RCRS was necessary 
an sufficient to produce high levels of gag proteins 7 
which were similar to the plasmid lacking CRS. The 

mutations necessary for the elimination of the INS are \ 
shown in Fig. 13. \ 



The above results demonstrate that HIV-i pol 



I 
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contains INS elements that can be detected and eliminated 
with the techniques described. 

These results also suggest that regions outside 
of the minimal inhibitory region in CRS as defined by A.W. 
Cochrane et al • , supra , influence the levels of 
expression. These results suggest that the RNA structure 
of the region is important for the inhibition of 
expression. 
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Table 1 

Correspondence between Sequence 
Identification Numbers and Nucleotides in Figure 4 



Seouence ! 


ID Nos . 


Ficrure 4 












SEQ 


ID 


NO 


:1 


nucleotides 


336 


-731 








SEQ 


ID 


NO 


:2 


nucleotides 


402 


-452 








SEQ 


ID 


NO 


:3 


nucleotides 


536 


-583, 


above 


line 




SEQ 


ID 


NO 


:4 


nucleotides 


585 


-634, 


above 


line 




SEQ 


ID 


NO 


:5 


nucleotides 


654 


-703, 


above 


line 




SEQ 


ID 


NO 


:6 


nucleotides 


402 


-452, 


below 


line 


(Ml) 


SEQ 


ID 


NO 


: 7 


nucleotides 


536 


-583, 


below 


line 


(M2) 


SEQ 


ID 


NO 


:8 


nucleotides 


585 


-634, 


below 


line 


(M3) 


SEQ 


ID 


NO 


:9 


nucleotides 


654 


-703, 


below 


line 


(M4) 



20 



25 



Table 2 

Synthetic oligonucleotides used 
in the mutagenesis of HIV- 1 crag and pol regions 

The upper sequence is the wild- type HIV-l as 
found in HIV^^ while the bottom is the mutant 
oligonucleotide sequence. The location of the sequence is 
indicated in parentheses . 



30 



M5gag (778-824) 

CACCTAGAACTTTAAATGCATGGGTAAAAGTAGTAGAAGAGAAGGCT (SEQ ID 
NO: 22) 

XX X X X XXX 

CACCTAGAACccTgAAcGCcTGGGTgAAgGTgGTAGAAGAGAAGGCT (SEQ ID 
NO: 23) 
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M6gag (871-915) 

C^CCCCACAAGATTTAAACACCATGCTAAACACAGTGGGGGGAC (SEQ ID NO 

X XX X XXX X 

CCACCCCACAgGAccTgAACACgATGtTgAACACcGTGGGGGGAC (SEQ ID NO 

M7gag (1105-1139) 
CAGTAGGAGAAATTTATAAAAGATGGATAATCCTG (SEQ ID NO: 26) 

CAGTAGGAGAgATcTAcAAGAGgTGGATAATCCTG (SEQ ID NO: 27) 

10 M8gag (1140-1175) 

GGATTAAATAAAATAGTAAGAATGTATAGCCCTACC (SEQ ID NO: 28) 

GGATTgAAcAAgATcGTgAGgATGTATAGCCCTACC (SEQ ID NO: 29) 

M9gag (1228-1268) 
^ ACCGGTTCTATAAAACTCTAAGAGCCGAGCAAGCTTCACAG (SEQ ID NO: 30) 

ACCGGTTCTAcAAgACcCTgcGgGCtGAGCAAGCTTCACAG (SEQ ID NO: 31) 

MIOgag (1321-1364) 
ATTGTAAGACT ATTTTAA. 

32) J-f^-^w^,.,. i uuvrft^LHULWL'TACACTA (SEQ ID NO: 

20 xxxxxxxxx 

ATTGTAAGAC c ATc CTgAAgGC t CTcGGc CCAGCGGCTACACTA (SEQ ID NO: 
Mllgag (1416-1466) 

MAGTOT^CTGAAGCMTGAGCCAAGT^CMATTCAGCTACCATAATG , SEQ 

25 x x x 

AGAG^CcGAgGCgATGAGCCAgGTgACgAA^TCgGCgACCATAATC (SEQ 




30 



Ml2gag (1470-1520) 

^t^^^ C ^ GA ^™TTAAGTGTTTCAAT TCT ,SEQ 

x XX XX X XX 

S^of??f C ^^ (SEQ 
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M13gag (1527-1574) 

GAAGGGCACACAGCCAGAAATTGC^GGGCCCCTAGGAAAAAGGGCTGT (SEQ ID 
NO: 38) 

XXX XX X 

GAAGGGCACACcGCCAGgAAcTGCcGGGCCCCccGGAAgAAGGGCTGT (SEQ ID 
NO: 39) 



M14gag (1581-1631) 

TGTGGAAAGGAAGGACACCAAATGAAAGATTGTACTGAGAGACAGGCTAAT ( SEQ 
ID NO: 40) 

XX X XXXXXX 

TGTGGAAAGGAgGGgCACCAgATGAAgGAcTGcACgGAGcGgCAGGCTAAT ( SEQ 
ID NO: 41) 

MOpol (1823-1879) (K to R difference introduced) 

CCCCTCGTCACAATAAAGATAGGGGGGCAACTAAAGGAAGCTCTATTAGATACAGGAG 
(SEQ ID NO: 42) 

XXX X X XX X 

CCCCTCGTCACAgTAAgGATcGGGGGGCAACTcAAGGAAGCgCTgcTcGATACAGGAG 
(SEQ ID NO: 43) 



Mlpol (1936-1987) 

GATAGGGGGAATTGGAGGTTTTATCAAAGTAAGACAGTATGATCAGATACTC ( S EQ 
ID NO: 44) 

XXXXX XXX XX 

GATAGGGGGgATcGGgGGcTTcATCAAgGTgAGgCAGTAcGAcCAGATACTC ( S EQ 
ID NO: 45) 



M2pol (2105-2152) 

CCTATTGAGACTGTACCAGTAAAATTAAAGCCAGGAATGGATGGCCCA (SEQ ID 
NO: 46) 

XXXXXX XX 

CCTATTGAGACgGTgCCcGTgAAgTTgAAGCCgGGgATGGATGGCCCA (SEQ ID 
25 NO: 47) 



M3.2pol (2162-2216) 

CAATGGCCATTGACAGAAGAAAAAATAAAAGCATTAGTAGAAATTTGTACAGAGA 
(SEQ ID NO: 48) 

X XXXXX X X 

CAATGGCCATTGACgGAAGAgAAgATcAAgGCcTTAGTcGAAATcTGTACAGAGA 
(SEQ ID NO: 49) 
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M4pol (2465-2515) 

S C N^r TACTGC ^ (SEQ 
M5pol (2873-2921) 

^AGTC^AAATTGAATTC^GCAAGTCAGATTTACCCAG^ATTAAAG (SEQ ID 

XX X XX X x 

TTAGTGGGGAAggTGAAcTGGGCgAGcCAGATcTACCCgGGGATTAAAG (SEQ ID 

M6pol (3098-3150) 

^SffS^ ATATC ^ TTTATC ^ G AGCCATTTAAAAATCTGAAAACAGG ( SEQ 
^S^^f^ ( SEQ 

M7pol (3242-3290) 

TG^AAAGACTCCTAAATTTAAACTGCCCATACAAAAGGAAACATGGG (SEQ ID 

^ X X X X 

TGGGGAAAGACgCCgAAgTT^ (SEQ ID 

M8pol (3520-3569) 

GAAGACTGAGTTACAAGCAAT^^^ (SEQ ID 

xxxxxxxxx x 

GAAGACTGAGcTgCAgGCgATcTAcCTgGCgcTGCAGGAcTCGGGATTAG (SEQ ID 



M8.2pol (3643-3698) 
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M9pol (3749-3800) 

GTCAGTGCTGGAATCAGGAAAGTACTATTTTTAGATGGAATAGATAAGGCCC ( S EO 
ID NO: 62) 

XX XX XXXXXX 
ID^O TG 63? GgATCC ^^ 9GTgCTATTCCT9GACGGgATCGM ( SEQ 

M9.2pol (3806-3863) 

GAACATGAGAAATATCACAGTAATTGGAGAGCAATGGCTAGTGATTTTAACCTGCCAC 
(SEQ ID NO: 64) 

X X XXX X XXX XXXX 

GAACATGAGAAgTAcCACtccAAcTGGcGcGCtATGGCcAGcGAcTTcAACCTGCCAC 
(SEQ ID NO: 65) 

MIOpol (3950-4001) 

GGAATATGGCAACTAGATTGTACACATTTAGAAGGAAAAGTTATCCTGGTAG ( SEO 
ID NO: 66) v 

XXXXXXXXXXXX 

GGAATATGGCAgCTgGAcTGcACgCAccTgGAgGGgAAgGTgATCCTGGTAG ( S EQ 
ID NO: 67) 



Mllpol (4031-4096) 

GCAGAAGTTATTCCAGCAGAAACAGGGCAGGAAACAGCATA 
-CAGGAAGA (SEQ ID NO: 68) 

XXX X XXXXXXXX XX 

GCAGAAGTTATcCCtGCtGAAACtGGGCAGGAgACcGCcTAcTTcCTgcTcAAAcTcG 
-CAGGAAGA (SEQ ID NO: 69) 



M12pol (4097-4151) 

TGGCCAGTAAAAACAATACATACTGACAATGGCAGCAATTTCACCGGTGCTACGG 
(SEQ ID NO: 70) 

XX XXXX XX X X 

TGGCCAGTgAAgACgATcCAcACgGACAAcGGaAGCAAcTTCACtGGTGCTACGG 
25 (SEQ ID NO: 71) 

M13pol (4220-4271) 

GGAGTAGTAGAATCTATGAATAAAGAATTAAAGAAAATTATAGGACAGGTAA (SEO 
ID NO: 72) 

X XX XX XXX 

GGAGTAGTAGAATCcATGAAc AAgGAAcTgAAGAAgATc ATcGGACAGGTAA (SEO 
ID NO: 73) 



M12pol-p (4097-4151) (indicates the sequence found in 
p3 7M12 3 4RCRSP10+P12p 

TGGCCAGTAAAAACAATACAcACgGACAAcGGaAGCAAcTTCACtGGTGCTACGG 
35 (SEQ ID NO: 74) 
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Table 3 



t-n f??^ nC !: S °£ mutant oligos designed 
to eliminate tho TNS eff^r-t- ^ , r p 

The six oligonucleotides used to eliminate the 
5 INS effect of fragment F (oligos #1 to #6) are set forth 
above in Example 2 (SEQ. ID. NOS. 10-15). 

Table 4 

10 de S tro? e ?SS nC f ° f mutant oligos designed to 

destroy TNS elements wirhin th* am, *„, lno rpHnn 

The wildtype (top) and the mutant oligo (below) 
of 26 different regions are shown, 
mutant o ligos for env of htv-i - 
15 Ml (5834-5878) 46-mer 

CTTGGGATGTTGATGATCTGTAGTGCTACAGAAAAATTGTGGGTC (SEQ ID NO: 

CTTGGGATGCTGATGA^ (gEQ ^ ^ 



M2 (5886-5908) 24-mer 

ATTATGGGGTACCTGTGTGGAAG (SEQ ID NO: 77) 

ATTATGGcGTgCCcGTGTGGAAG (SEQ ID NO: 7 8 ) 
M3 (5920-5956) 38-mer 

CACTCTATTTTGTGC^TCAGATCCTAAAGCATATGAT (SEQ ID NO: 79) 

CACTCTATTcTGcGCcTCcGAcGCcAAgGCATATGAT (SEQ ID NO: 80) 
M4 (5957-5982) 27-mer 

ACAGAGGTACATAATGTTTGGGCCAC (SEQ ID NO: 81) 
ACAGAGGTgCAcAAcGTcTGGGCCAC (SEQ ID NO: 82) 
M5 (6006-6057) 53-mer 

£*SS^ (SEQ 
x X X X XX " " 



A A A XX XX XYY-w 

^f C 8" 9GA9GT9GT3C ^ T3MCGTGA " G 4AAcTTcAACA TOTC (SEQ 
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M6 (6135-6179) 46-mer 

TAACCCCACTCTGTGTTAGTTTAAAGTGCACTGATTTGAAGAATG (SEQ ID NO- 
85) 

X X X XX X X XX 

TAACCCCcCTCTGcGTgAGccTgAAGTGCACcGAccTGAAGAATG (SEQ ID NO: 
86) 

5 M7 (6251-6280) 31-mer 

ATCAGCACAAGCATAAGAGGTAAGGTGCAG (SEQ ID NO: 87) 
X XX X X 

ATCAGCACcAGCATccGcGGcAAGGTGCAG (SEQ ID NO: 88) 
M8 (6284-6316) 34-mer 

GAATATGCATTTTTTTATAAACTTGATATAATA (SEQ ID NO: 89) 
X X X X X X 
10 GAATATGCcTTcTTcTAcAAgCTgGATATAATA (SEQ ID NO : 90) 

M9 (6317-6343) (28-mer) 

CCAATAGATAATGATACTACCAGCTAT (SEQ ID NO: 91) 
X XXX 

CCAATAGcTAAgGAcACcACCAGCTAT (SEQ ID NO : 92) 

15 M10 (6425-6 469) (46-mer) 

GCCCCGGCTGGTTTTGCGATTCTAAAATGTAATAATAAGACGTTC (SEQ ID NO: 
93) 

XXX XXXXXX 

GCCCCGGCcGGcTTcGCGATcCTgAAgTGcAAcAAcAAGACGTTC (SEQ ID NO: 
94) 

Mil (6542-6583) (42-mer) 

20 CAACTGCTGTTAAATGGCAGTCTAGCAGAAGAAGAGGTAGTA (SEQ ID NO: 95) 

XXX XXXXX 

CAACTGCTGcTgAAcGGCAGcCTgGCcGAgGAgGAGGTAGTA (SEQ ID NO: 96) 
M12 (6590-6624) (35-mer) 

TCTGTCAATTTCACGGACAATGCTAAAACCATAAT (SEQ ID NO: 97) 
X X XXX 

25 TCTGCCAAcTTCACcGACAAcGCcAAgACCATAAT (SEQ ID NO: 98) 

M13 (6632-6663) (32-mer) 

CTGAACACATCTGTAGAAATTAATTGTACAAG (SEQ ID NO: 99) 

XXXXXX 
CTGAACCAgTCcGTgGAgATcAAcTGTACAAG (SEQ ID NO: 100) 



30 



M14 (6667-6697) (31-mer) 

CAACAACAATACAAGAAAAAGAATCCGTATC (SEQ ID NO: 101) 

X X X XX X 

CAACAACAAcACcGGcAAgcGcATCCGTATC (SEQ ID NO: 102) 
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M15 (6806-6852) (47-mer) 

NSy^oT^^^^^^^^^^CTT (SEQ ID 

xxxxxxxxxxxxx 

NS! AG 04t gCT9CGCG ^ (SEQ ID 

5 M16 (nt 6917-6961) (45-mer) 

TOCTACTGTAATTCAACAO^CTGTTTAATAGTACTTGGTTTAAT (SEQ ID NO: 

^CTACTGgAAcTCcACcCA^ (SEQ ID NO: 



M17 (nt 7006-7048) (43-mer) 

CACAATCACCCTCCCATGCAGAATAAAACAAATTATAAACATG (SEQ ID NO: 
CACAATCACcCTg^ (seq ^ ^ 

M18 (nt 7084-7129) (46-mer) 

^TCAGTGGAC^TTAGATGTTCATCAAATATTACAGGGCTGCTA (SEQ ID NO: 

xxxxxxxxxxx 

CATCAGCGGcCAgATccGcTGcTCcTCcAAcATcACGGGGCTGCTA (SEQ ID NO: 

5?" i nt 7195-7252) (58-mer) 
20 ^ID^^ 

M20 (nt 7594-7633) (40-mer) 

GCCTTGGAATGCTAGTTGGAGTAATAAATCTCTGGAACAG (SEQ ID NO: 113) 

GCCTTGGAAcGCcAGcTGGAGcAAcAAgTCcCTGGAACAG (SEQ ID NO: 114) 
M21 (nt 7658-7689) (32-mer) 

GAGTGGGACAGAGAAATTAACAATTACACAAG (SEQ ID NO: 115) 
GAGTGGGACcGcGAgATcAACAAcTACACAAG (SEQ ID NO: 116) 
M22 (nt 7694-7741) (48-mer) 

ATACACTCCTTAATTGAAGAATCGCAAAAC^ (SEQ ^ 

ATACACTC^ (SEQ ID 



30 



35 



BMSOOCIO: <WO 932Q212A1 1 > 



4 



WO 93/20212 




PCT/US93/02908 



- 80 - 

M23 (nt 7954-7993) (40-mer) 

CAGGCCCGAAGGAATAGAAGAAGAAGGTGGAGAGAGAGAC (SEQ ID NO: 119) 

XXXXXXXX 

CAGGCCCGAgGGcATcGAgGAgGAgGGcGGcGAGAGAGAC (SEQ ID NO: 12 0) 
M24 (nt 8072-8121) (50-mer) 

TACCACCGCTTGAGAGACTTACTCTTGATTGTAACGAGGATTGTGGAACT (SEQ ID 
5 NO: 121) 

XXXXXX XX X 

TACCACCGCcTGcGcGACcTgCTCcTGATcGTgACGAGGATcGTGGAACT (SEQ ID 
NO : 122 ) 

M25 (nt 8136-8179) (44-mer) 

GGTGGGAAGCCCTGAAATATTGGTGGAATCTCCTACAGTATTGG (SEQ ID NO: 
123) 

10 X XX XX 

GGTGGGAgGCCCTCAAgTAcTGGTGGAAcCTCCTcCAGTATTGG (SEQ ID NO- 
124) 

M26 (nt 8180-8219) (40-mer) 

AGTCAGGAACTAAAGAATAGTGCTGTTAGCTTGCTCAATG (SEQ ID NO: 125) 
XX XXXXXX 

15 AGTCAGGAgCTgAAGAAcAGcGCcGTgAaCcTGCTCAATG (SEQ ID NO: 126) 



Comments : 

Although the vast majority of oligonucleotides 
follow the HXB2 sequence, some exceptions are noted: 

In oligo M15, nt 6807 follows the pNL43 
sequence. (Specifically, nt 6807 is C in NL43 but A in 
HBX2.) Oligo M26 has the nucleotide sequence derived from 
pNL43. 



EXAMPLE 5 

USE OF OR P3 7M1-10D OR P55M1-13P0 IN 
IMMUNOPROPHYLAXIS OR IMMUNOTHERAPY 

In postnatal gene therapy, new genetic 
information has been introduced into tissues by indirect 
means such as removing target cells from the body, 
infecting them with viral vectors carrying the new genetic 
information, and then reimplanting them into the body; or 
by direct means such as encapsulating formulations of DNA 
in liposomes; entrapping DNA in proteoliposomes containing 
viral envelope receptor proteins; calcium phosphate co- 
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precipitating DNA; and coupling DNA to a polylysine- 
glycoprotein carrier complex. m addition, in vivo 
infectivity of cloned viral DNA sequences after direct 
intrahepatic injection with or without formation of 
calcium phosphate coprecipitates has also been described 
mRNA sequences containing elements that enhance stability 
have also been shown to be efficiently translated in 
XenoEus laevis embryos, with the use of cationic lipid 
vesicles. See, e.g., j. A . Wolff, e t al., Science 
247:1465-1468 (1990) and references cited therein. 

Recently, it has also been shown that injection 
of pure RNA or DNA directly into skeletal muscle results 
m significant expression of genes within the muscle 
cells. J.A. Wolff, et al., Science 247:1465-1468 (1990) 
Forcing RNA or DNA introduced into muscle cells by other 
means such as by particle-acceleration (N. -s. Yang, et 
a1 ' Proc. Nan , Acad. Sri TTSA 87:9568-9572 (1990); S.R 
Williams et al., Proc. Natl Acad «h tto» 88:2726-2730 
(1991)) or by viral transduction should also allow the DNA 
or RNA to be stably maintained and expressed. In the 
experiments reported in Wolff et al., RNA or DNA vectors 
were used to express reporter genes in mouse skeletal 
muscle cells, specifically cells of the quadriceps 
muscles. Protein expression was readily detected and no 
special delivery system was required for these effects 
Polynucleotide expression was also obtained when the 
composition and volume of the injection fluid and. the 
method of injection were modified from the described 
protocol. For example, reporter enzyme activity was 
reported to have been observed with 10 to 100 M i of 
hypotonic, isotonic, and hypertonic sucrose solutions 
Opti-MEM, or sucrose solutions containing 2mM CaCl, and 
also to have been observed when the 10- to 100- /il 
injections were performed over 20 min. with a pump instead 
of within l min. 

Enzymatic activity from the protein encoded by 
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the reporter gene was also detected in abdominal muscle 
injected with the RNA or DNA vectors, indicating that 
other muscles can take up and express polynucleotides. 
Low amounts of reporter enzyme were also detected in other 
tissues (liver, spleen, skin, lung, brain and blood) 
injected with the RNA and DNA vectors. Intramuscularly 
injected plasmid DNA has also been demonstrated to be 
stably expressed in non-human primate muscle. S. Jiao et 
al., Hum. Gene Therapy 3:21-33 (1992). 

It has been proposed that the direct transfer of 
genes into human muscle in situ may have several potential 
clinical applications. Muscle is potentially a suitable 
tissue for the heterologous expression of a transgene that 
would modify disease states in which muscle is not 
primarily involved, in addition to those in which it is. 
15 For example, muscle tissue could be used for the 

heterologous expression of proteins that can immunize, be 
secreted in the blood, or clear a circulating toxic 
metabolite. The use of RNA and a tissue that can be 
repetitively accessed might be useful for a reversible 
type of gene transfer, administered much like conventional 
pharmaceutical treatments. See J. A. Wolff, et al., 
Science 247:1465-1468 (1990) and S. Jiao et al., Hum. Gene 
Therapy 3:21-33 (1992). 

It had been proposed by J. A. Wolff et al., 
supra , that the intracellular expression of genes encoding 
antigens might provide alternative approaches to vaccine 
development. This hypothesis has been supported by a 
recent report that plasmid DNA encoding influenza A 
nucleoprotein injected into the quadriceps of BALB/c mice 
resulted in the generation of influenza A nucleoprotein- 
specific cytotoxic T lymphocytes (CTLs) and protection 
from a subsequent challenge with a heterologous strain of 
influenza A virus, as measured by decreased viral lung 
titers, inhibition of mass loss, and increased survival. 
J. B. Ulmer et al.. Science 259:1745-1749 (1993). 
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Therefore, it appears that the direct injection 
of RNA or DNA vectors encoding the viral antigen can be 
used for endogenous expression of the antigen to generate 
the viral antigen for presentation to the immune system 
without the need for self -replicating agents or adjuvants, 
resulting in the generation of antigen- specif ic CTLs and 
protection from a subsequent challenge with a homologous 
or heterologous strain of virus. 

CTLs in both mice and humans are capable of 
recognizing epitopes derived from conserved internal viral 
proteins and are thought to be important in the immune 
response against viruses. By recognition of epitopes from 
conserved viral proteins, CTLs may provide cross -strain 
protection. CTLs specific for conserved viral antigens 
can respond to different strains of virus, in contrast to 
antibodies, which are generally strain- specif ic . 

Thus, direct injection of RNA or DNA encoding 
the viral antigen has the advantage of being without some 
of the limitations of direct peptide delivery or viral 
vectors. See. J. A. Ulmer et al . , supra, and the 
discussions and references therein). Furthermore, the 
generation of high- titer antibodies to expressed proteins 
after injection of DNA indicates that this may be a facile 
and effective means of making antibody- based vaccines 
targeted towards conserved or non- conserved antigens, 
either separately or in combination with CTL vaccines 
targeted towards conserved antigens. These may also be 
used with traditional peptide vaccines, for the generation 
of combination vaccines. Furthermore, because protein 
expression is maintained after DNA injection, the 
persistence of B and T cell memory may be enhanced, 
thereby engendering long-lived humoral and cell-mediated 
immunity. 



35 



BNSOOCID: <WO 9320212A1 I > 



10 



15 



20 



25 



30 




WO 93/20212 J^fc PCT/US93/02908 




- 84 - 



1. Vectors for the immunoprophylaxis or 

immuno therapy against HIV-1 

The mutated gag genomic sequences in vectors 
P37M1-10D or p55Ml-l3P0 (Fig. 6) will be inserted in 
expression vectors using a strong constitutive promoter 
such as CMV or RSV, or an inducible promoter such as 
HIV- 1 . 

The vector will be introduced into animals or 
humans in a pharmaceutical^ acceptable carrier using one 
of several techniques such as injection of DNA directly 
into human tissues; electroporation or transfection of the 
DNA into primary human cells in culture (ex vivo ) , 
selection of cells for desired properties and 
reintroduction of such cells into the body, (said 
selection can be for the successful homologous 
recombination of the incoming DNA to an appropriate 
preselected genomic region) ; generation of infectious 
particles containing the gag gene, infection of cells ex 
vivo and reintroduction of such cells into the body; or 
direct infection by said particles in vivo. 

Substantial levels of protein will be produced 
leading to an efficient stimulation of the immune system. 

In another embodiment of the invention, the 
described constructs will be modified to express mutated 
gag proteins that are unable to participate in virus 
particle formation. It is expected that such gag proteins 
will stimulate the immune system to the same extent as the 
wild- type gag protein, but be unable to contribute to 
increased HIV-1 production. This modification should 
result in safer vectors for immunotherapy and 
immunophrophylaxis . 
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EXAMPLE 6 



^Y TI ^ N r7r H ^ V " D l EXPRESSI0N USING TRANSDOMINANT 
(TD)- TD-GAG-TD REV OR TD GAG - PRO - TD REV GENES 

5 Direct injection of DNA or use of vectors other 

than retroviral vectors will allow the constitutive high 
level of trans -dominant gag (TDgag) in cells. in 
addition, the approach taken by B.K. Felber et al 
Science 239:184-187 (1988) will allow the generation of 
1Q retroviral vectors, e.g. mouse-derived retroviral vectors 
encoding HIV-i TDgag, which will not interfere with the 
xnfection of human cells by the retroviral vectors. In 
the approach of Felber, et al., supra, it was shown that 
fragments of the HIV-l LTR containing the promoter and 
i5 part of the polyA signal can be incorporated without 

detrimental effects within mouse retroviral vectors and 
remaxn transcriptionally silent. The presence of Tat 
protein stimulated transcription from the HIV-l LTR and 
resulted in the high level expression of genes linked to 

the HIV-l LTR. 

The generation of hybrid TDgag - TDRev or TDgag- 
pro-TDRev genes and the introduction of expression vectors 
xn human cells will allow the efficient production of two 
proteins that will inhibit HIV-l expression. The 

25 incorporation of two TD proteins in the same vector is 
expected to amplify the effects of each one on viral 
replication. The use of the HIV-l promoter in a matter 
sxmxlar to one described in B.K. Felber, et al., supra 
wxll allow high level gag and rev expression in infected 

3Q cells. In the absence of infection, expression will be 
substantially lower. Alternatively, the use of other 
strong promoters will allow the constitutive expression of 
such proteins. This approach could be highly beneficial 
because of the production of a highly immunogenic gag 

35 which is not able to participate in the production of 
xnfectious virus, but which, in fact, antagonizes such 
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production. This can be used as an efficient 
immuniproptiylactic or immunotherapeutic approach against 
AIDS. 

Examples of trans -dominant mutants are described 
in Trono et al., Cell 59:112-120 (1989). 

5 

1. Generation of constructs encoding 
transdominant aacr mutant proteins 

Gag mutant proteins that can act as trans - 
dominant mutants, as described, for example, in Trono et 
^0 al., supra, will be generated by modifying vector 

P37M1-10D or p55Ml-13P0 to produce transdominant gag 
proteins at high constitutive levels. 

The transdominant gag protein will stimulate the 
immune system and will inhibit the production of 
15 infectious virus, but will not contribute to the 
production of infectious virus. 

The added safety of this approach makes it more 
acceptable for human application. 

20 Those skilled in the art will recognize that any 

gene encoding a mRNA containing an inhibitory/instability 
sequence or sequences can be modified in accordance with 
the exemplified methods of this invention or their 
functional equivalents. 

25 Modifications of the above described modes for 

carrying out the invention that are obvious to those of 
skill in the fields of genetic engineering, protein 
chemistry, medicine, and related fields are intended to be 
within the scope of the following claims. 

30 Every reference cited hereinbefore is hereby 

incorporated by reference in its entirety. 
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WHAT IS CLAIMED Tfi - 

1. A method for reducing the effect of 
inhibitory/instability sequences within the coding region 
of a mRNA, said method comprising the steps of: 

(a) providing a gene which encodes said 
mRNA; 

(b) identifying the inhibitory/instability 
sequences within said gene which 
encode said inhibitory/ instability 
sequences within the coding region of 
said mRNA; 

(c) mutating said inhibitory/instability 
sequences within said gene by making 
multiple point mutations ; 

(d) transfecting said mutated gene into a 
cell ; 

(e) culturing said cell in a manner to 
cause expression of said mutated gene; 

(f) detecting the level of expression of 
said gene to determine whether the 
effect of said inhibitory/instability 
sequences within the coding region of 
the mRNA has been reduced. 

2. The method of Claim 1 further comprising 
the step of fusing said mutated gene to a reporter gene 
prior to said transfecting step and said detecting step is 
performed by detecting the level of expression of said 
reporter gene. 

3. The method of Claim 1 wherein step (b) 
further comprises the steps of 

(a) fusing said gene or fragments of said 
gene to a reporter gene to create a 
fused gene; 

(b) transfecting said fused gene into a 
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cell ; 

(c) culturing said cell in a manner to 
cause expression of said fused gene; 

(d) detecting the level of expression of 
said fused gene to determine whether 
the expression of said fused gene is 
reduced relative to the expression of 
said reporter gene. 
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4. The method of Claim 3 wherein step (a) 
comprises fusing said gene or fragments of said gene 3' to 
the stop codon of said reporter gene. 

5. The method of Claim 3 wherein step (a) 
comprises fusing said gene or fragments of said gene in 
frame with the 3' end of the coding region of said 
reporter gene. 
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6. The method of Claim 1 or 2 wherein said 
mutating step changes the codons such that the amino acid 
sequence encoded by the mRNA is unchanged. 

7. The method of Claim 6 wherein said 
inhibitory/ instability sequences are AT-rich and wherein 
said mutating step comprises substituting either G or C 
for either A or T and wherein the final nucleotide 
composition of said mutated inhibitory sequence is about 
50% A and T and about 50% G and C. 



o 



8. The method of Claim 6 wherein at least 75 
of the point mutations replace conserved nucleotides with 
non- conserved nucleotides. 



9. The method of Claim 6 wherein said mutating 
step comprises substituting less preferred codons with 
more preferred codons. 
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, 10 " ^ ° f Claim 1 ° r 2 wher ein said 

mRNA encodes the GAG protein of a p« v a* ^ 

^ ein or a R ev- dependent complex 

retrovirus . 

11. The method of Claim 10 wherein Che Rev- 
dependent complex retrovirus is human immunodeficiency 

nn , A meth ° d of increasing the production of a 

llTZtlT' WhSrein P ° ly ^ tide i- encoded by a mRNA 

that contains one or more inhibitory/instability 
sequences, said method comprising the steps of: 

(a) providing a gene which encodes said 
mRNA; 

15 (t>) identi fying the inhibitory/instability 

sequences within said gene which 
encode said inhibitory/instability 
sequences within the coding region of 
said mRNA; 

20 (C) mutatin 9 said inhibitory/instability 

sequences within said gene by making 
multiple point mutations; 

(d) transfecting said mutated gene into a 
cell ; 

25 (S) culturin 3 said cell in a manner to 

cause expression of said mutated gene; 
(f) detecting the level of expression of 

said gene to determine that the effect 
of said inhibitory/instability 

30 sequences within the coding region of 

the mRNA has been reduced; 
<g> providing a host cell transfected with 
an expression vector containing said 
mutated gene; 

35 <h) culturin 9 said host cell to cause 

expression of said polypeptide; and 
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(i) recovering said polypeptide. 

13, A method of producing polypeptides, whose 
native production is impeded by the presence of an 
inhibitory/ instability sequence, comprising the steps of: 

(a) providing a host cell transfected with 
an expression vector containing a gene 
encoding said polypeptide, said gene 
having been mutated to decrease the 
effect of the inhibitory/ instability 
sequence; 

(b) culturing said host cell to cause 
expression of said polypeptide; and 

(c) recovering said polypeptide. 

15 14. The method of Claim 13 wheirein said host 

cell is prokaryotic. 
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15 . The method of Claim 13 wherein said host 
cell is eukaryotic. 

16. The method of Claims 13, 14 or 15 wherein 
said gene is a cDNA. 

17. The method of Claims 13, 14 or 15 wherein 
25 said gene is genomic. 

18 . An artificial nucleic acid construct 
comprising a gene wherein the expression of the native 
gene is impeded by the presence of inhibitory/ instability 
sequences in the mRNA encoded by said native gene, said 
gene having being mutated to decrease the effect of the 
inhibitory/ instability sequence. 

19 . The construct of Claim 18 wherein the amino 
35 acid sequence encoded by said mutated gene is the same as 
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the amino acid sequence encoded by the native gene. 

20. The construct of Claim 19 wherein said 
native gene is HIV-i gag. 

21. The construct of Claim 20 wherein said HIV- 
l gag gene has been mutated by the introduction of 
multiple point mutations between nucleotides 402 and 452 
536 and 583, 585 and 634, and 654 and 703. 

22. The construct of claim 19 wherein said 
native gene is HIV-i env. 

23. An assay kit for identifying 
inhibitory/instability sequences in a mRNA, comprising: 

(a) the nucleic acid construct of Claim 20 
or 21; and 

(b) a detection system for detecting the 
level of expression of said gene in 
said nucleic acid construct. 

24. The kit of Claim 23 wherein said detection 
system is an ELISA. 

25. An artificial nucleic acid construct 
comprising a gene mutated by the method of Claim 1 or 2. 

26. A vector comprising the nucleic acid 
construct of Claim 25. 

27. a transformed host cell comprising the 
artificial nucleic acid construct of Claim 25. 

28. A vector comprising the nucleic acid 
construct of Claim 18 or 19. 
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29. A transformed host cell comprising the 
artificial nucleic acid construct of Claim 18 or 19. 

30. A transformed host cell of Claim 29 wherein 
said cell is selected from the group consisting of 

5 eukaryotes and prokaryotes. 

31. The host cell of Claim 30 wherein said cell 
is a human cell. 

10 32. The host cell of Claim 30 wherein said cell 

is a Chinese Hamster Ovary cell . 

33. The host cell of Claim 30 wherein said cell 
is E. coli. 

34. The construct of Claim 20 wherein said HIV- 
1 gag gene has been mutated by the introduction of 
multiple point mutations between nucleotides 402 and 452, 
536 and 583, 585 and 634, 654 and 703, 871 and 915, 1105 
and 1139, 1140 and 1175 and 1321 and 1364. 

35. The construct of Claim 34 wherein said HIV- 
1 gag gene is p3 7Ml-10D. 

25 36. The construct of Claim 20 wherein said HIV- 

1 ga-ST gene has been mutated by the introduction of 
multiple point mutations between nucleotides 402 and 452, 
536 and 583, 585 and 634, 654 and 703, 871 and 915, 1105 
and 1139, 1140 and 1175, 1321 and 13 64, 1416 and 1466, 

30 1470 and 1520, 1527 and 1574, and 1823 and 1879. 

37. The construct of Claim 36 wherein said HIV- 
1 <J a ?J gene is p55Ml-l3P0. 

JJ 38. A vaccine composition for inducing immunity 
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in a mammal against HIV infection comprising a 
pharmaceutical^ acceptable medium and further comprising 
a therapeutically effective amount of a nucleic acid 
construct capable of producing HIV gag protein in the 
absence of any HIV regulatory protein in a cell in vivo. 

w . r . ^ A vaccine composition according to claim 38 
wherein said mammal is a human. 

wh _. 40 ^ A VaCCinS com P°sition according to claim 38 
wherein said regulatory protein is HIV-l Rev. 

wherein * C ° mp ° Sition according to claim 38 

wherein said construct is selected from the group 

consisting of the construct of claim 20, 21, 34 35 36 
and 37. A - ^. J6, 
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ln , 0 , • meth ° d f ° r induc ing immunity against HIV 

infection m a mammal which comprises administering to a 
—l a therapeutically effective amount of a vaccine 
composition comprising a nucleic acid construct capable of 

producing HIV gaq nrotPin -ir, *-k~ i_ 

a y«*y protein m the absence of any HIV 

regulatory protein in a cell in vivo. 

mammal i ^ I A aCCOrdin * to c1 *™ 42 wherein said 

m ammal is a human. 
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- ^ ' A aCCOrdin 5 to claim 42 wherein said 

regulatory protein is HIV-l Rev. 

rmc , 45 * A method a ^ording to claim 42 wherein said 

construct is selected from the group consistina of the 
construct of claim 20, 21, 34, 35, 36. and 37." 
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1 """srec c^^c ^mccr tgatctgtgg ATeaMca n 

n cxtcccxgax tscc^cx acacaccagg gcca gggaxc agaxaxccac Mgg ATmTw ^T 

Kl ^ACTAC CAOXT^CC AGAGCAAGTA GAAG AGGCCA JtMMCM - 
21 I CTAXGAGCCA GCAXGCCAXG «««« J)aa acMcr ATTAGTGTGG AAGTTTGA CA CCCT^" 
281 ATTTCGTCAC AXGGCCCGAG AGCXGCAXCC GG AGTACTAC AAAGACTGCT GACATCG AGC 
351 GGACXTXCCG CXGGGGACTT TCCAGGGAGG XGXG GCCXGG GCGGGACTGG GGAGT GGCGA GCCC^ 
<21 CCXACAXATA AGCAGCTGCX X^ccXst 

« TCTCXGGCTA ACTAGGGAAC CCACTGCXTA A^CTCAAXA ^ .7=^7 

561 TGCCCGTCTG TXGXGTGACT CXGGXAACXA GAGATC CCXC AGACCCXTTT AGT CAGTGTG GAAAATrrrr 

d «1 ^ f /**!> I 'mu i <»»- — ^ ■ — 



631 AGCAGXGGCG CCCGAACAGG^GACXTGAAAG CGAAAGXAAA GCCAGAGGAG AXCXCXCGAC GCAGCACXCG 



BssHII (711) 

* 1 Q I*' «»r Vtl L.||«., o ,01 yal uUii/UpArJ™ 




1157 ACATCAGGCCATATCACCTAGAACTTTia 

■ 9 leuAsnAI aTrpV a n. ysVa | Va , q, uG| uLy( . A| aPheSa , ProG| uVa| 



— ' uLyoni aKneser ProG) uva 1 

i3og yA ' * Thf PfoGI nA «PL«uAsnThrMai LauAtnThr Val a yGI yH 

' 9 UPf0Ar 9GI ySerAspl I eAI «ffl yThr Th f Ser Thr LeuGI n 

is37 ■ . •Krovi Gl yGI ul I «TyrLy 6 ArgTrpl I el I efuQ yL 

138 > ISa^^S!!??^ 

■ ■ " roThrS °' 1 1 , f uAspl I eArgGl nGl yP roLysGI uProPheArgAs 

fig. II B 
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1613 CTATGTAGACCGGTTCTATAAAACTCTAAGAGCTGAGCA^^ 
i€3 * Pl"yrValAspArgPheTyrLysTh r LeuArgAI aGl uGl nAI aSer Gt nGI uVa I LysAsnT rpMs t Thr Gi uThr 

1589 TTGTTGGTCCAAAATGCGAACCCAGATTGTAAGACCATCCTGAAGGCTCTCGGCCCAGCGGCTACACTAG^ 
189» LeuLeuVal GI nAsn AI aAsnProAspCysLysThr I I eLeuLysAI aLeuGI yProAl aAl aThr LeuGI uGI uM 

stop (1818) Xba! (189a)! 

1765 TGATGACAGCATGTCAGGGAGTAGGAGGACCCGGCCATAAGGCAAGAGTTTTGTAGGGRTCCACT^ 
214 ► etfcbt Thr Al aCysGi nGI yVal GI yGI yProGI yHi sLysAI aArgVal Lou 



1841 


Apal 

CGAGGGGGGG 


(1848) 

CCCGGXACCT 


TTAAGACCAA 


TGACTTACAA 


GGGAGCTGTA 


GATCTTAGCC 


ACUTITTAAA 


1911 


AGAAAAGGGG 


GGACTGGAAG 


GGCTAATTCA 


CTCCCAAAGA 


AGACAAGATA 


TCCTTGATCT 


GTGGATCTAC 


1981' 


CACACACAAG 


GCTACTTCCC 


TGATTGGCAG 


AACTACACAC 


CAGGGCCAGG 


GGTCAGATAT 


CCACTGACCT 


2051 


TTGGATGGTG 


CTACAAGCTA 


GTACCAGTTG 


AGCCAGATAA 


GGTAGAAGAG 


GCCAATAAAG 


GAGAGAACAC 


2121 


CAGCTTGTTA 


CACCCTGTGA 


GCCTGCATGG 


AATGGATGAC 


CCTGAGAGAG 


AAGTGTTAGA 


GTGGAGGTTT 


2191 


GACAGCCGCC 


TAGCATTTCA 


TCACGTGGCC 


CGAGAGCTGC 


ATCCGGAGTA 


CTTCAAGAAC 


TGCTGACATC 


2261 


GAGCTTGCTA 


CAAGGGACTT 


TCCGCTGGGG 


ACTTTCCAGG 


GAGGCGTGGC 


CTGGGCGGGA 


CTGGGGAGTG 


2331 


GCGAGCCCTC 


AGATGCTGCA 


TATAAGCAGC 


TGCTTTTTGC 


CTGTACTGGG 


TCTCTCTGGT 


TAGACCAGAT 


2401 


CTGAGCCTGG 


GAGCTCTCTG 


GCTAACTAGG 


GAACCCACTG 


CTTAAGCCTC 


AATAAAGCTT 


GCCTTGAGTG 


2471 


CTTCAAGTAG 


TGTGTGCCCG 


TCTGTTGTGT 


GACTCTGGTA 


ACTAGAGATC 


CCTCAGACCC 


TTTTAGTCAG 


2541 


TGTGGAAAAT 


CTCTAGCACC 


CCCCAGGAGG 


XAGAGGTTGC 


AGTGAGCCAA 


GATCGCGCCA 


CTGCATTCCA 



2611 GCCTGGGCAA GAAAACAAGA CTGTCIAAAA TAATAATAAT AAGTTAAGGG TATTAAATAT ATTTATACAT 

2681 GGAGGTCATA AAAATATATA TATTTGGGCT GGGCGCAGTG GCTCACACCT GCGCCCGGCC CTTTGGGAGG 

2751 CCGAG GCAGG TGGATCACCT GAGTTTGGGA GTTCCAGACC AGCCTGACCA ACATGGAGAA ACCCCTTCTC 

2821 TGTGTATTTT TAGTAGATTT TATTTTATGT GTATTTTATT CACAGGTATT TCTGGAAAAC TGAAACTGTT 

2891 TTTCCTCTAC TCTGATACCA CAAGAATCAT CAGCACAGAG GAAGACTTCT GTGATCAAAT GTGGTGGGAG 

2961 AGGGAGGTTT TCACCAGCAC ATGAGCAGTC AGTTCTGCCG CAGACTCGGC GGGTGTCCTT CGGTTCAGTT 

3031 CCAACACCGC CTGCCTGGAG AGAGGTCAGA CCACAGGGTG AGGGCTCAGT CCCCAAGACA TAAACACCCA 

3101 AGACATAAAC ACCCAACAGG TCCACCCCGC CTGCTGCCCA GGCAGAGCCG ATTCACCAAG ACGGGAATTA 

3171 GGATAGAGAA AGAGTAAGTC ACACAGAGCC GGCTGTGCGG GAGAACGGAG TTCTATTATG ACTCAAATCA 

3241 GTCTCCCCAA GCATTCGGGG ATCAGAGTTT TTAAGGATAA CTTAGTGTGT AGGGGGCCAG TGAGTTGGAG 

3311 ATGAAAGCGT AGGGAGTCGA AGGTGTCCTT TTGCGCCGAG TCAGTTCCTG GGTGGGGGCC ACAAGATCGG 

3381 ATGAGCCAGT TTATCAATCC GGGGGTGCCA GCTGATCCAT GGAGTGCAGG GTCTGCAAAA TATCTCAAGC 

3451 ACTGATTGAT CTTAGGTTTT ACAATAGTGA TGTTACCCCA GGAACAATTT GGGGAAGGTC AGAATCTTGT 

3521 AGCCTGTAGC TGCATGACTC CTAAACCATA ATTTCTTTTT TGTTTTTTTT TTTTTATTTT TGAGACAGGG 

_ Pstl (3639) 

3591 TCTCACTCTG TCACCXAGGC TGGAGTGCAG TGGTGCAATC ACAGCTCACT GCAGCCCCTA GAGCGGCCGC 

3661 CACCGCGGTG GAGCTCCAAT TCGCCCTATA GTGAGTCGTA TTACAATTCA CTGGCCGTCG TTTTACAACG 

3731 TCGTGACTGG GAAAACCCTG GCGTTACCCA ACTTAATCGC CTTGCAGCAC ATCCCCCTTT CGCCAGCTGG 

3801 CGTAATAGCG AAGAGGCCCG CACCGATCGC CCTTCCCAAC AGTTGCGCAG CCTGAATGGC GAATGGCGCG 

3871 AAATTGTAAA CGTTAATATT TTGTTAAAAT TCGCGTTAAA TTTTTGTTAA ATCAGCTCAT TTTTTAACCA 

3941 ATAGGCCGAA ATCGGCAAAA TCCCTTATAA ATCAAAAGAA TAGACCGAGA TAGGGTTGAG TGTTGCTCCA 

4011 GTTTGGAACA AGAGTCCACT ATTAAAGAAC GTGGACTCCA ACGTCAAAGG GCGAAAAACC GTCTATCAGG 

4081 GCGATGGCCC ACTACGTGAA CCATCACCCT AATCAAGTTT TTTGGGGTCG AGGTGCCGTA AAGCACTAAA 

4151 TCGGAACCCT AAAGGGAGCC CCCGATTTAG AGCTTGACGG GGAAAGCCGG CGAACGTGGC GAGAAAGGAA 

4221 GGGAAGAAAG CGAAAGGAGC GGGCGCTAGG GCGCTGGCAA GTGTAGCGGT CACGCTGCGC GTAACCACCA 

4291 CACCCGCCGC GCTTAATGCG CCGCTACAGG GCGCGTCCCA GGTGGCACTT TTCGGGGAAA TGTGCGCGGA 

4361 ACCCCTATTT GTTTATTTTT CTAAATACAT TCAAATATGT ATCCGCTCAT GAGACAATAA CCCTGATAAA 
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. 501 TgSSS TGAGTATTCA ACATTTCCGT GTCGCCCTTA TTCCCTI^T 

4571 TTGGGTGCAC GAGTGGctS S^S? gffff*** CTGGTGAAAG TAAAAGATGC TGAAGATCAG 
4 641 AAGAACGTTT 'TCCAATGATG AG^SJSa GCGGTAAGAT CCTTGAGAGT 1TTCGCCCCG 

4711 CGGGCAAGAG CAACTCGGTC gSSSS ATGTGGCGCG GTATTATCCC GTATTGACGC 

4781 GAAAAGCATC TTACGGATGG S^T^ G AATGACTTGG TTGAGTACTC ACCAGTCACA 

4851 CTGCGGCCAA CTTAotS^ AC^Sra GCAGTGCTGC CATAACCATG AGTGAIAACA 

4921 GGATCATGTA ACTCGCCTTG aSgSSS GGAGCIAACC GCTTTTTTGC ACAACATGGG 

4991 ACCACGATGC CTCTAGCAAT gSSSSS ScS ™=CAAACGA CGAGCGTGAC 

5061 CCCGGCAACA ATTAATAGAC TGGATGGAGG ^WTTAACTGG CGAACTAC1T ACTCTAGCTT 

5131 GGCTGGCTGG TTTATTGCTG SSSSgI ^^f! TGCAGGACCA CTTCTGCGCT CGGCCCTTCC 
5201 GGGCCAGATG GTAAGCCCTC rrrwJSS? ^SBf® 51 * 3 * 5 CGTGGGTCTC GCGGTATCAT TGCAGCACTG 
5271 GAAATAGaS StcSSg SSSSS £S£££ """OW* ATGGATgSc 

5341 ATATAXACTT TAGATTCATT i££5£S ££££££ fOOTGGTAA CTGTCAGACC AAGTTTACTC 
5411 AATCTCATGA CCAAAATCCC iWSS SSiES AAAAGGATCT AGGTGAAGAT CCTTTTTGAT 
5481 AAGGATCTTC TTGAGATCCT S?S ACTGAGCGTC AGACCCCGTA GAAAAGATCA 

5551 AGCGGTGGTT 1CTTTGCCGG ATCAAGAGCT SZZ™*** ACMA ****C CACCGCTACC 

5621 CAGATACCAA ATACTGTCCT TcSSSS ££££££ •f'TCCGAAGG TAACTGGCTT CAGCAGAGCG 
5691 CTACATACCT CGCTC^CTA ATCcSSc GCCACCACTT CAAGAACTCT GTAGCACCGC 

5761 GTTGGACTCA AGACGATAGT SSSSf ^CASTCCC 5AIAAGTCGT GTCTTACCGG 

5831 CCCAGCITGG AGCGAACgS SSSSi ^«^CT^A CGGGGGGTTC GTGCACACAG 

5901 TTCCCGAAGG GAGAAAGGCG ££^2^ XACAGCGTGA GCTATGAGAA AGCGCCACGC 

5971 GCTTCCAGGG GGAAACGCCT SSS5 ACAGGAGAGC GCACGAGGGA 

6041 TTTTTGTGAT GCTCGTCAGG GSGGCGsS GGGTTTCGCC ACCTCTGACT TGAGCGTCGA 

6111 TGGCCTTTTG CTGGCCTTTT SSS£S ^EIF* 5 ™* CGCG «CTTT TTACGG1TCC 

6181 TACCGCCTTT GAGTGAGCTG SSESS SS***** «"«GIGG ATAACCGIAT 

6251 GAAGCGGAAG AGCGCCCAAT SSScG CCTCTCCCCG AGTGAGCGAG 
6321 ACGACAGGTT TCCCGACTGG AAAGCGGGCa CGC5TTGGCC GATTCATTAA TGCAGCTGGC 

6391 GGCACCCCAG GCTTTACACT TTATGCTTrr CGCAMTAAT GTGAGTTAGC TCACTCATIA 

6461 CACACAGGAA ACAGCTATGA ^G^TOGAA TTGTGAGCGG ATAACAATTT 

Psil (6533) CCATGATTAC GCCAAGCTCG GAATTAACCC TCACTAAAGG GAACAAAAGC 

JS SSK SSSc cSg^S TCAGGCTGCC Ferrer agcggctgcc 

6671 GAGCGGTAAG ACTGCGAGAG S2SS2S I^GCCAAGG GGTGCGCACA 

6741 CAGCCTCACA GGGGGTTTAT SSSS! GGGTTTATCA CAGTGCACCC TGACAGTCGT 

6811 ACCCTTACAA TCATTCCATT SSSSf ^ ^ 5 CTCAGCCT » CAGGGGGTXT ATCACAGTGC 
6881 TGATCAGAGG TGTGTTCCCA GAGGGGAAAA rl^t^l CTCTACTGTG CCTAACTTCT AAGTTAAATT 
6951 CACCTGGGTC TTGGAATGTG tcSSag^ SSSSS ^E TCAGT *CT*TCGCAT TTCAGGCCTC 
7021 ACACAAGATA ACCAAGACAC SxSSS ^^^5! AC CTCAGTTG GATCTCCACA GGTCACAGTG 
7091 AACTGCCATG TCGGAGGTGC SSSSS GGCCGCCCTC CACGTGCACA TCGCCGGAGG 

7161 GCTTCCAGCC ATCCACCTGA TGAA^A^ SfS!?" 31 TG6AGGGAGG GACCAGCGCA 

TGAACAGAAC CTAGGGAAAG CCCCAGTTCT ACTTACACCA GGAAAGGC 

FT 3 - D 
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